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Title: METHOD OF ASSAYING FOR DRUGS WHICH RESTORE 
FUNCTION OF MUTANT CFTR PROTEIN 

BACKGROUND OF THE INVENTION 

Cystic fibrosis is a human genetic disease of 
the secretory epithelia. Although the survival rate 
of those suffering with cystic fibrosis has improved 
in recent years , the median age for patient survival 
is still only about twenty five to thirty years 
despite intensive supportive and prophylactic 
treatment. Today cystic fibrosis remains the most 
common congenital disease among Caucasians, where it 
has a prevalence of about one in two thousand live 
births, and is uniformly fatal. Nearly all patients 
suffering from the disease develop chronic 
progressive disease of the respiratory system, the 
most common cause of death being pulmonary disease. 
Also, in the majority of cases, pancreatic 
dysfunction occurs; hepatobiliary and genitourinary 
diseases are also frequent. Because of the multi- 
system clinical manifestations of the disease, 
current methods of treatment for the disease have 
focused on therapeutic approaches to reduce the 
symptoms of cystic fibrosis. 

United States Patent No. 5,100,647 to Agus, et 
al, discloses a method for treating cystic fibrosis 
by administration of the compound sparteine 
(dodecahydro-7, 14methano-2H, 6H-di-pyrido [1,2-a: 
l , / 2 , -e] [1/5] diazocine), acting as a direct 
exogenous activator of chloride conductants in 
epithelial airways. United States Patent No. 
5,179,001 to Young, et al, discloses a method of 
treating pulmonary complications associated with 
cystic fibrosis caused by the gram negative 
bacterium Pseudomonas aeruginosa . United States 
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Patent No. 4,826,679 to Roy relates to an oral 
composition for alleviating digestive manifestation 
in persons afflicted with cystic fibrosis comprising 
a therapeutic amount of taurine. Despite much 
advancement in the treatment of the symptoms of 
cystic fibrosis very little has been done to 
effectively "cure" the disease at a genetic level. 

One method of gene therapy proposed is United 
States Patent No. 5,14 9,797 disclosing a method of 
site specific alteration of RNA and production of 
encoded polypeptides. This invention is drawn to 
correcting the abnormal mRNA present in individual 
cells, cleaving the mRNA by site directed RNAase 
followed by introduction of the appropriate 
oligoribonucleotide followed by endogenous RNA 
ligase and thus production of a wild-type mRNA 
encoding a normal protein product which then may be 
translated to produce the correct protein. 

Cystic fibrosis is characterized at the genetic 
level by mutations in the gene encoding the cystic 
fibrosis transmembrane conductance regulator (CFTR) 
protein. The CFTR protein is a member of a 
superfamily of structurally related membrane 
proteins named the traffic ATPase or ATP-binding 
cassette (ABC) transporters. Other members of this 
superfamily include the multi-drug resistance (MDR) 
or P-glycoprotein, bovine adenyl cyclase, the yeast 
STE6 protein, as well as several bacterial amino 
acid transport proteins (Riordan, et al f 19 89; Hyde, 
et al, 1990). The principle distinguishing feature 
of this superfamily of proteins is a highly 
conserved nucleotide-binding domain (NBD). 

CFTR is a protein of approximately 14 80 amino 
acids consisting of two repeated motifs, each 
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comprising 6 transmembrane segments and a 
nucleotide-binding domain. The two motifs are 
separated by a large polar R-domain containing 
multiple potential phosphorylation sites. 

Many members of the ABC transport superfamily 
transport small molecules across biological 
membranes in an ATP dependent manner. However, CFTR 
is a CI" channel regulated by phosphorylation and by 
cytosolic ATP (Anderson, et al, "Nucleoside Triphos- 
phates are Required to Open CFTR Chloride Channel" 
Cell, 67, 775-784 ) . The two nucleotide-binding 
domains present in CFTR play a central role in the 
function of CFTR CI"" channels; they interact 
directly with ATP to open the CFTR CI" channel and 
ATP hydrolysis may be required for this effect. The 
importance of the NBD's is further emphasized by the 
large number of cystic fibrosis associated mutations 
that have been found in the domains. The most 
common cystic fibrosis associated mutation, 
accounting for approximately 68% of the cystic 
fibrosis chromosomes is deletion of a phenylalanine 
at position 508 (AF508) in the middle of NBD1. 

This mutation is characterized by a loss of 
apical membrane Cl~ channel activity in the 
epithelial cells leading to chronic progressive 
disease of the respiratory system. The AF508 
mutation causes loss of apical membrane CI" channel 
activity affecting the CFTR protein in two ways. 
First, the mutation causes defective processing and 
hence mislocalization of the mutant protein. This 
is presumed to occur from xnisf olding of the mutant 
protein so that it fails to exit from the 
endoplasmic reticulum and progress to the Golgi 
complex in the apical membrane. The resulting Cl~ 
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transport defect associated with the AF508 is thus 
largely a consequence of the absence of functional 
CFTR at the plasma membrane. Second the AF508 
mutation affects the function of the Cl" channel. 
Single channel analysis of CFTR AF508 suggests that 
although the mutant protein is functional, it has an 
open state probability (P G ) that is one third that 

of wild-type CFTR. 

Defective trafficking of CFTR AF508 can be 
partially reversed if cells are grown at a lower 
temperature, indicating that mutant protein is still 
functional. When CFTR AF508 is produced at a 
reduced temperature (less than 37°C) some of the 
mutant protein transists the Golgi complex normally 
and is localized correctly in the plasma membrane. 
(See Denning, G.M.; Anderson, M.P.? Amara, J.; 
Marshalt, J.; Smith, A.E.J and Welsh, M.J.); 
-Processing of Mutant CFTR (AF508) is Temperature 
Sensitive." Nature, 358 , 761-764 . Since CFTR AF508 
retains partial function, a possible therapy for 
cystic fibrosis may involve pharmacologic 
intervention designed to correct the processing 
defects so that more CFTR AF508 reaches the plasma 
membrane. Potential pharmacologic intervention 
would represent a non gene-therapy approach for the 
treatment of cystic fibrosis. 

As can be seen, a great need exists in the art 
for methods of treating cystic fibrosis that do not 
concentrate solely on the symptoms produced by the 
disease, but rather treat the disease at a cellular 
level by attempting to restore function to the 
mutant product. In light of the scientific 
potential for pharmacologic treatments, a method for 
identification of new candidate drugs for cystic 
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fibrosis treatment is warranted* This is 
complicated f however , by the fact that a screening 
method to identify treatments that reverse the CFTR 
AF508 CI" transport defect in mammalian cells is 
impractical; large numbers of chemical compounds 
cannot easily be screened for an activity that 
restores the cAMP-stimulated Cl~ transport of mutant 
CFTR. 

It is an object of this invention to provide an 
in vitro method of screening pharmaceutical 
compositions, and further mutations for their 
ability to restore function to defective ABC 
transport proteins resulting from mutations such as 
CFTR AF508, by simple means of a yeast mating assay. 

It is another object of the invention to 
construct chimeric gene sequences which encode the 
yeast STE6 gene and the CFTR gene for screening of 
treatments which will restore function to the mutant 
proteins . 

It is another object of the invention to 
construct chimeric gene sequences which encode the 
yeast STE6 gene and the P-glycoprotein gene (or 
other ABC transporter genes) for screening 
treatments which will inhibit the function of the 
chimeric protein. 

Yet another object of the invention is to 
provide a method for identifying potentially useful 
compounds for further useful study in the treatment 
of disease by means of a yeast mating assay. 

Other objects of the invention will become 
obvious from the description of the invention which 
follows . 
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SUMMARY OF THE INVENTION 

This invention relates to a method for 
screening treatments which restore the function of 
mutant GFTR by simple means of a yeast cell mating 
assay. According to the present invention, chimeric 
genetic sequences which encode the yeast a-factor 
transporter STE6 and CFTR or another member of the 
related superfamily such as P-glycoprotein are 
constructed whereby a segment of the CFTR genetic 
sequence is exchanged for a corresponding segment of 
the STE6 genetic sequence to produce a product which 
will complement a yeast ste6A mutation but is non- 
functional upon introduction of a desired mutation 
into the CFTR sequence. 

The resulting chimeras are then transformed 
into a yeast strain which has the STE6 gene deleted 
(ste6A). The yeast strain is then exposed to the 
desired drug or treatment to be screened and the 
strain is assayed for the presence of restoration of 
mating, which indicates a functional chimera. The 
invention discloses several chimeric sequences 
useful for screening for utility of treatments and 
reversing the effects of the CFTR AF508 mutation 
which is associated with cystic fibrosis. 

DETAILED DESCRIPTION OF THE DRAWINGS 

Figure 1 represents the amino acid sequence 

alignment of NBDl of STE6 and CFTR. 

Figure 2 depicts the STE6-CFTR chimeric 

constructs and their mating efficiency. 

Figure 3 is a graphical depiction of a yeast 

assay according to the present invention which 

presents a potential cystic fibrosis treatment. 
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Figure 4a-g is the gene sequence for chimera 

H4. 

Figure 5a-g is the gene sequence for chimera 

H5. 

Figure 6a-g is the gene sequence for chimera 

H6. 

DETAILED DESCRIPTION OF THE INVENTION 

The STE6 gene of Saccharomyces cerevisiae 
encodes an integral membrane protein that functions 
to transport the peptide a-factor out of the cell 
during mating. (Kuchler and Thorner, 1992) , 
"Functional expression of human MDRl in yeast 
Saccharomyces cerevisiae ." Proc. Natl. Acad. Sch., 
USA89, 2303-2306. Deletion of the STE6 gene from 
yeast (ste6A) results in a sterile phenotype. 
According to the present invention , chimeric STE6 
genes were constructed in which segments of the NBD1 
of STE6 were substituted by the analagous NBD1 
sequences from CFTR. 

Selection for sites of exchange was 
accomplished by comparing the known amino acid 
sequences of CFTR and STE6. The CFTR and STE6 amino 
acid sequences were aligned by use of the "BEST FIT" 
algorithm for the comparison of DNA and amino acid 
sequences obtained from the University of Wisconsin 
and found in the GCG Sequence Analysis Package. 

Figure 1 represents an amino acid sequence 
alignment of the NBD for proteins CFTR and STE6. 
The CFTR amino acid sequence is depicted on top 
illustrating amino acids 442 through 57 8; the 
corresponding analagous STE6 amino acid sequence is 
depicted below illustrating amino acids 376 through 
536. Corresponding amino acids are indicated by 
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solid vertical bars and give an indication of 
potential junctions for the genetic constructs. 

Table 1 indicates the portion of CFTR amino 
acids sequence inserted into STE6 for the different 

hybridS ' TABLE 1 

PORTION OF CFTR 
»vm,Tn TTYENTIFIER AMTNO AC TP SEQUENCE 

„, D443-Y577 
H7 D443-F508 

F494-F508 

" 4 F494-I546 

F494-L558 

JJg F494-Y577 

The HI STE6-CFTR chimeric gene was constructed 
with a DNA sequence coding for the entire NBD1 of 
STE6 (from N377 to A535 ) replaced by the 
corresponding region of CFTR (D443 to Y557 ) . (See 
Figure 1.) H2 replaced the amino terminal half of 
NBD and H6 replaced the carboxy terminal half. The 
remaining constructs were variations between these 
two extremes. (See Figure 2. ) The construction of 
plasmids with these sequences used methods which are 
well known to those of skill in the art however the 
following description is included merely for 
purposes of illustration and is not intended to 
limit in any way the invention. 

PT.ASMID CONSTRUCTIONS 
Plasmid RFG416 (a gift from Rick Gaber, 
Northwestern University) is a single copy CEN 
plasmid with the selectable marker URA3 and the 
pUC19 polylinker region. A 6.5 kb Sall-Sacl 
fragment containing the STE6 gene was subcloned from 
STE6-2U (a gift from John McGrath, Massachusetts 
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institute of Technology) into the Sail and Sacl 
sites of the vector RFG416 to produce plasmrd JTS6. 
The yeast TRPl gene (on a 0.85 kb BgLll-EcoRl DNA 
fragment) was inserted at nucleotide position 1356 
of STE6 NBDl, resulting in JTS6T. 

To construct the Hi STE6-CFTR hybrid plasmrd, a 
441 bp DNA fragment containing the CFTR NBDl region 
was synthesized using two STE6-CFTR oligonucleotide 
primers (primer 1, 5 • CCTTCGGAAGC AGTCC^GAAAGATAT- 3 ' ; 
primer 2, 5 ' -GATGAACAATATCTAGGTATCCAAAA-3 ' ) and CFTR 
cDNA template DNA in a polymerase chain reaction 
(PCR) (Ho, et al., 1989). PCR reactions were 
performed with a Temp-Tronic thermal cycler 
(Barnstead/Thermolyne). Oligonucleotide primers 1 
and 2 encoded fusion junctions of STE6 L375 to CFTR 
K442 and CFTR 1578 to STE6 D537, respectively, found 
in the HI STE6-CFTR hybrid gene. The CFTR portion 
of each primer is underlined. STE6 DNA flanking NBDl 
was added to each end of the 441 bp fragment by PCR, 
resulting in a 867 bp DNA fragment consisting of 419 
bp of CFTR DNA (encoding CFTR amino acids K442-L578) 
that is flanked at the 5' end by 168 bp of STE6 DNA 
(encoding STE6 amino acids K319-L375) and 280 bp of 
STE6 DNA at the 3' end (encoding STE6 amino adds 
D537-G640). Plasmid construction of HI was 
performed by cotrans formation (Ito, et al, 1983) of 
yeast strain JPY201 with the 867 bp DNA fragment and 
3 ug of plasmid JTS6T and selection of trans- 
formants on SD-URA (yeast nitrogen base supplemented 
with all amino acids except uracil). Homologous 
recombination between the STE6 DNA sequences at each 
end of the 867 bp DNA fragment with the STE6 gene on 
the plasmid results in the targeted integration 
(Orr-Weaver, et al, 1981) of the CFTR sequences into 
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NBD1 of STE6 on plasmid JTS6T and the consequent 
loss of the TRP1 gene. Recombinants containing the 
desired STE6-CFTR hybrid gene were identified as trp 
auxoptrophs at a frequency of about 1% among the 
transf ormants . Plasmid DNA was prepared from trp 
transf ormants and the structure of the STE6-CFTR 
gene was confirmed by DNA sequencing analysis. 
STE6-CFTR hybrid genes H2-H6 were similarly 
constructed using the following oligonucleotide 
primers for constructing the appropriate STE6CFTR 
junctions; STE6 Q440/CFTR F494, 5 f - 
CACCGTCGTAGAA CAGTTTTCCTGGATTA -3 ' ; CFTR G509/STE6 
S457, 5 ' -CCGAATCTGTTG AACCAAAGATGATATTT -3 ' : CFTR 
G550/STE6 G509, 5 ' -TTGTTGTTGCCCGCC ACTCAGT3TGATTC -3 ' : 
CFTR R560/STE6 A519, 5' ATCTCTGATGAATGC TCTTGCTAAA - 
GAAAT- 3 ' . 

The gene sequences for H4, H5, and H6 are 
included in Figures 4-6. 

The resulting plasmids were then transformed 
into a yeast strain with the STE6 gene deleted 
(ste6A) to test for complementation by means of a 
yeast mating assay. 

YEAST CELL MATING ASSAY 
Several types of variations of assays may be 
used and are commonly known to those of skill in the 
art. The following description is not intended to 
limit in any way the invention. 

The yeast strains JPY201 (MATa ste6A:HIS3 f 
gal2, ura3-52, lys2-801, trpl, leu2-3,112, his3 
A200) and 22-2D (MATa, ura352, leu2-3 , 112 , trpl ) were 
used for all mating experiments. JPY201 contains a 
STE6 deletion (including NBD1 and extending beyond 
the termination codon) and replacement with the 
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yeast HIS3 gene (McGrath and Varshavskiy, 1989). 
Quantitative mating assays were performed as in 
Trueheart, et al (1987). 

Transf ormants of JPY201 containing each STE6- 
CFTR chimera were grown to log phase in 0.1% glucose 
SD-URA media. From each culture, 3 x 10 6 cells were 
mixed with an equal number of 22-2D cells grown in 
YPD media and collected by filtration onto a 
Millipore filter, which was then placed upon a YPD 
plate for 4 hours at 30°C. Cells were resuspended, 
sonicated briefly, and plated from serial dilutions 
onto SD+LEU/TRP (yeast nitrogen base supplemented 
with leucine and tryptophan) . Diploid colonies were 
counted after 3 days at 30°C. Control strains STE6 
wild-type and ste6A consisted of JPY201 transformed 
with plasmids JTS6 and JTS6T, respectively. For 
qualitative petri dish mating assays, JPY201 
transf ormants containing STE6-CFTR chimeras were 
grown as patches on SD-URA media and then replica 
printed to YPD media on a lawn of 22-2D cells. 
Following incubation at 30°C for 8 hr, the plate 
containing the mating cells was replica printed to 
SD-LEU/TRP and incubated 3 days at 30°C to allow 
growth of diploid colonies. 

The complementation results are given in Figure 
2. As can be seen yeast transf ormants containing 
the HI sequence were unable to complement the ste6A 
mutation, indicating a nonfunctional STE6 chimera. 
Similarly, the H2 sequence replacing the amino half 
only was nonfunctional. See Figures 1 and 2- These 
results suggest that the amino terminal region of 
NBD1 from CFTR cannot substitute for that of STE6. 

In contrast, the STE6-CFTR chimeras containing 
the central part of NBD1 from CFTR (H3 and H4 , See 
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Figures 1 and 2) maintained a-f actor transport 
activity comparable with the wild-type STE6 yeast 
strain. This suggests that amino acid residues 
between the conserved Walker A and B motifs (Walker, 
et al, 1982) although different in the NBD1 of CFTR 
and STE6 provide a similar overall structure and 
function. When the chimeras included progressively 
larger segments of the carboxy-terminal region of 
NBD1 from CFTR (H5 and H6 ) mating efficiency was 
reduced. Yeast transformed with the hybrid gene H5 
which contained the substitution of 74 residues of 
STE6 NBD1 (from R441 to 1516) by that from CFTR 
(from F494 to L558) mated at 12% efficiency of wild- 
type STE6. Mating efficiency was further reduced in 
H6 to background levels. However, a low level of 
mating was observed when the H6 gene construct was 
expressed on a high copy number plasmid. 

INTRODUCTION OF THE AF508 MUTATION 
A previous study has suggested that missense 
mutations in STE6 that were analogous to CF- 
associated mutations in NBDl of CFTR result in 
defective a-f actor transport, however, single amino 
acid deletions analogous to AF508 within the central 
region of STE6 NBDl had no effect on the STE6 
function. (Berkower and Michaelis, "Mutational 
Analysis of the Yeast a-f actor Transporter STE6; a 
Member of the ATP Binding Cassette (ABC Protein 
Superfamily ) , " EMBO J 10, 3777-3785 (1991)). 

Contrary to this finding, it was found that by 
incorporating CFTR into NBDl of the STE6 gene and 
also including the AF508 mutation within the CFTR 
segment, function of the STE6 chimeric gene was 
inhibited. The chimeric sequence constructs H3, H4, 
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*te6A yeast strain were also used to assess the 
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1, CPTH a^no acid F494 . I546; 



I ^^S94/04379 

\V0 94/25607 



14- 

5 CFTR amino acid 
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yeast strain to the desired treatment may be used 
including modifying external conditions such as 
temperature . 

The mating plate is then incubated for 24 hours 
to allow yeast cell mating to occur. If a compound 
corrects the molecular defect associated with 
H5-AF508 transporter then a-factor is transported 
allowing mating between the two strains, resulting 
in diploid cells. The mating plate is then replica 
printed to a selective media plate to allow growth 
of diploid colonies. 

For example strain A could be - MATa, 
STE6 : :EIS3, ura3-52 , lys2 , trpl-298 , his3-200, leu2- 
3,112 <plasmid H5-AF508, URA3, CEN>; strain B - 
MATa, ura3-52, trpl, leu2-3,112. 

The selective media used would then be lacking 
lysine and uracil. Consequently strains A and B 
could not grow on this media (strain A has the lys2 
mutation and strain B has the ura3-52 mutation; 
these mutations prevent them from going on media 
without lysine and uracil respectively), however 
diploids produced as a result of the fusion between 
strain A and B can grow (diploids contain the 
genomes of both haploids and consequently has the 
complementing URA3 and LYS2 wild-type allele). 

Production of yeast diploid cells in the area 
surrounding the treatment disc would be a strong 
indicator of presence of compound(s) which correct 
the AF508 defect. A number of yeast mating type 
assays may be developed easily by those of skill in 
the art and the foregoing description is not 
intended to limit in any way the invention. 
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F.X AMPLE 1 



♦ interest in CFTR and other trafficking 

ZZ linker sequence of HBD1, postulated to 
"notion *s a transducer of signals 
Hydrophobic domains and the HBD's (Mimura, * L 
LJ It was hypothesized that a possible 
treatment to correct the F508 station =ou!d be 
in ration of the region o £ the KBD polypeptide 
containing F508, with the K553-L558 region The 
only difference between the H5 chimera and the 

himera is 6 amino aoids in the R553-L558 region. 
= L is also not present in the H3 chimera in 

This region is also no v possibly 
„hich the AF508 mutation had no effect. Possi y 
„hl=h the A n the two regions of 

there is an interactio £unct ion (or 

the KBO required ^ ^ 

b0th ' t on ITsld test whether a mutation in the 
invention was useu. effects 
region R553-L558 might suppress the adverse 

4: ^>,~ AF508 in NBDl . 
° £ ; o isolate revertants of the B5-nFB08 mating 
defect, the H5-AFS08 construct was mutagenized in 
the «3-I« B region in vitro by site-spec: ic 
oligonucleotide ™^™~*J£ ^ J sing th e 
Dil ected H nta,enesis by ^ -P ^ 

random amino acid substitutions at sing e codon 
p ositions ----- f. Possible 

-0 ^ position 
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R553-L558. The „,utageni 2 ad DBA was tr.„ , 
yeast, and transform colonies Cere f 

These yeast tran.f tr *nsf ormants anal y2ed . 

yeast transformants each cont^'n^ 
Plasmid with a mutation .* C ° ntained ™ H5-AF508 

in on. „ mut ^ 10 n at ammo acid R553 of CFTR* 

-J-n one case, rr^-j r »-s~ Lf ^ 

~9i« o f \l H t; 5 r o: ut : tion8 withi » «» «h». 

"creased l ting f^"" -«> -suit in 

*=53« TO tant "e r : a n a 0 i: nC r. h0WeVer ' R5 "« - 

dins are shown in t»ki« n 
expressed relative to th. ln . TaMe 2 ' Wlt * results 

containing the B5-A F 508/ R 5«n ! ySaSt 
Plasms „ated at 3, a„ d 32% B5 - AF5 ° 8 ^"« 
-ertant station, r^ 1'"™" ^ 
the mating defect. Th ^ " 

had little effect „„ „ (B5 . R553 " M T OT 

— transformed into yea t To l L 

-ting ef f iciency W 'J£Z ~ - 

yeast containing hs (Ta MaT 

this invention '. " aoc °rdance with 

- e«ect:t;:r;::;:^r — -~ 
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TABLE__2 

. „ E «icie n cy ot Reverants 
relative Mating ^ E « ic iency 




H5-R553M 
B5 _^F508/R553M 

H5 _AF508/R553Q 

H5 ' AF508 • es were d*~^ ^^SS^" 

mating assays a %) # Data 
relative to B= \ 

d R553Q 

TESTIS OF TK^^XSSB 



MOT£SIO£SJE-^ BE ------r^lOT B which 

TO d^S^S-.^J^ a lso correct the 
! „ the AF508 mutation w ° a * d R55 3M 

r^t In -an *«•-. £^0. 

-rr «trte: ^ - ions 

kalian cells -« ^ revett ant . 

struct* to ^»7 e£ect in c^-re,ulate d 

would correc te sted for 

transport of CFTR * R553M we re tes 

The revertants R55^v ^ and the channel 
on luislocalrzatio rta nts 
their effects on ide ntxfred rev 

-thionine P»^ ed by three ^^J^ b v 
CFTR as lat ed protein as mar 

Reduced f uUV ^VOOSV^d P ^ 
P ele ctrephoretic rf the mutant 

•* increased the app determined 
ge i ; xt xner roerob rane as de x 

protein in th« P 1 it produced fun 

Uunocytochemxcally, 
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channels in the plasma membrane when measured by the 
SPQ halide efflux assay. These results demonstrate 
the functional utility of the yeast model system for 
identifying revertants of CF-associated mutations of 
the NBD1 of CFTR and its use in drug screening for 
pharmacologic compounds that correct the mating 
defect. The mutations discovered could be used to 
identify peptides which may be of utility in 
reversing the effects of the AF508 mutation. 

EXAMPLE 3 

R553Q AND R553M SUPPRESS THE 
CFTR AF5 08 ANION TRANSPORT DEFECT 

The effect of the R553Q and R553M mutations on 
CFTR function was determined by a cAMP-stimulated 
halide efflux using the halide-sensitive fluorophore 
6-methoxy-N- ( 3-sulf opropyl ) -quinolinium ( SPQ ) 
(Illsley and Verkman, 1987). Expression of CFTR cDNA 
containing either the R553Q or the R553M mutation 
alone (without the AF508 mutation) in HeLa cells 
generated cAMP- stimulated halide efflux like wild- 
type CFTR. Cells expressing the AF508 allele in 
this recombinant system showed little, if any, cAMP- 
stimulated halide efflux. However , when the 
mutations R553Q and R553M were introduced into CFTR 
AF508 (CFTR AF508/R553Q and CFTR AF508/R553M, 
respectively), cAMP-dependent anion permeability was 
restored. These results indicate that the CFTR CI" 
channel defect observed with the AF508 mutant could 
be suppressed by either R553 mutation. 

CI" transport by CFTR AF508 containing the 
R553Q and R553M mutations would be detected only if 
the processing defect of CFTR AF508 was suppressed. 
To determine further whether the suppressor 
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mutations correct the processing defect associated 
with the AF508 mutation, the glycosylation patterns 
of CFTR and the various mutants expressed in HeLa 
cells were examined. CFTR is a glycoprotein that 
undergoes progressive glycosylation, resulting in 
three bands that migrate at different rates on an 
SDS-polyacrylamide gel. Band A is the most rapidly 
migrating and represents the nascent, unglycosylated 
protein; band B has an intermediate rate of 
migration and a pattern of core glycosylation 
consistent with processing in the endoplasmic 
reticulum; band C migrates most slowly and has a 
pattern of mature glycosylation consistent with 
processing in the Golgi complex. CFTR AF508 was 
only presented as the unglycosylated band A and the 
core glycosylated band B protein, consistent with 
its failure to traverse the Golgi complex and reach 
the plasma membrane. 

We first examined the glycosylation state of 
CFTR in HeLa cells expressing CFTR cDNA containing 
either the R553Q and R553M mutations alone (without 
the AF508 mutation). Band C was present in cells 
expressing wild- type CFTR and also mutant CFTR 
containing either the R553Q or R553M mutation. In 
contrast, only bands A and B were present in cells 
expressing CFTR AF508. Thus, the R553Q and R553M 
mutations alone do not affect the glycosylation of 
CFTR. 

In cells transfected with the CFTR 
AF508/R553M, a small increase in band C was 
detectable as compared with CFTR AF508. This result 
was observed in three separate experiments. Band C 
was not able to be consistently detected in cells 
expressing CFTR AF508/R553Q possibly owing to 
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limitations in the sensitivity of the assay. 
However, a detectable increase in Band C with CFTR 
AF508/R553M as compared with CFTR AF508 occurred. 

To assess further the ability of suppressor 
mutations to correct the mislocalization defect of 
CFTR AF508, immunocytochemistry was used to detect 
CFTR at the cell surface in HeLa cells expressing 
wild-type or mutant CFTR. An antibody specific to an 
external epitope in the first extracellular loop of 
CFTR was used. (Denning, et al, 1992b). Wild-type 
CFTR is generally detected at the surface of 
unpermeabilized HeLa cells. In contrast, no CFTR 
was detectable at the surface of cells expressing 
CFTR AF508. However, when cells expressed CFTR 
AF508/R553M, CFTR was detected at the plasma 
membrane. For CFTR AF508/R553Q, plasma membrane 
staining of CFTR was weak and variable and could not 
be demonstrated consistently. Only nonspecific 
staining was observed with preimmune serum or in the 
absence of CFTR. These results were consistent with 
the observation that more CFTR AF508/R553M than CFTR 
AF508/R553Q was found in the band C form and suggest 
that the amount of CFTR AF508/R553Q at the plasma 
membrane is exceedingly low. 

RTNKT.F— CHANNEL ANALYSIS OF CFTRAF508 /R553Q CFTR 
To determine whether the suppressor mutations 
altered the single-channel properties of CFTR AF508, 
the revertant mutant CFTR AF508/R553Q was expressed 
in HeLa cells and analyzed with the patch-clamp 
technique. Single-channel analysis of CFTR AF508 
indicated that the P Q of the CFTR AF508 Cl~ channel 
was reduced as compared with wild- type CFTR. 
Denning, et al , (1992a) reported that CFTR AF508 had 



WO 94/25607 



PCT/US94/04379 



•22- 

a P Q of 0.13 + 0.01 (after incubation at reduced 
temperature), compared with a value of 0.34 ± 0.02 
for wild-type, and Dalemans , et al , (1991) reported 
that the P D of CFTR AF508 Cl" channels ranged from 
0.05 to 0.10, compared with a range of 0.22 to 0.35 

for wild type. 

As expected from the response observed in the 
SPQ studies, CFTR/R553Q and CFTR AF508/R553Q formed 
functional Cl~ channels. Inside-out membrane 
patches from either mutant showed no CI" channel 
activity under basal conditions. However, addition 
of protein kinase A(PKA) and ATP to the cytosolic 
surface of the membrane patch activated channels in 
both cases; both agents were required to activate 
the channels (n = 6 for each mutant). Single- 
channel events were recorded from +60 mV to -120 mV 
in increments of 20 mV. The channels were Cl" 
selective as indicated by the reversal potentials (E 
= 20 mV:E c i- = 27 mV) . CFTR/R553Q had a single- 
channel slope conductance (from -120 mV to O mV) of 
9.8 ± 0.5 pS (n = 7), and CFTR AF508/R553Q had a 
conductance of 10.2 ± 0.2 pS (n = 11). The current 
measured at -100 mV for CFTR/R553Q (-1.1 ± 0.1 pA, 
n = 6) and CFTR AF508/R553Q (-1.2 ± 0.0 pA, n = 5) 
was not different from the current measured for 
wild-type CFTR (-1.1 + 0.0 pA, n = 7). Thus, these 
data indicate that neither mutation altered the 
single-channel conductive properties of the CFTR Cl~ 
channel, nor did the mutations abolish the 
regulation by PKA. The results also indicate that, 
as predicted by the functional analysis of CFTR 
AF508/R553Q by the SPQ halide efflux assay above, 
CFTR AF508/R553Q is localized in the plasma 
membrane . 
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Following activation with PKA and ATP, we 
measured the P Q for CFTR/R553Q and CFTR AF508/R553Q 
at different MgATP concentrations. As the 
concentration of MgATP increased, both CFTR/R553Q 
and CFTR AF508/R553Q spent more time in the open 
state. The P Q of CFTR/R553Q CI" channels was 
similar to that previously reported for wild-type 
CFTR CI" channels (Anderson and Welsh, 1992). 
However, at ImM ATP, the P Q of CFTR AF5 08/R553Q 
(0.29 ± 0.02, n = 7) was greater than that of CFTR 
AF508 CI" channels (0.13 + 0.01, n = 4). Thus, the 
R553Q mutation corrected the functional defect in 
gating of the CFTR AF508 CI" channels. 

YEAST MATING ASSAY WITH PLANT EXTRACT SCREENING 
A suspension of yeast cells (strains A and B) 
was spread upon the surface of agar media in a 
petri dish. 

The suspension consisted of equal proportions 
of two haploid yeast strains. Strain A - MATa, 
STE6::HIS3, ura3-52,lys2,trpl-298, his3-200, 
leu2-3,112 <plasmid H5-AF508, URA3, CEN>; strain B 
-MATa, ura3-52, trpl, leu2-3,112. 

Yeast strains A and B are of the a and a 
mating types respectively, and can mate to form 
yeast diploids. Strain A contains the STE6/CFTR 
hybrid a- factor transporter gene H5-AF508 (Teem, et 
al, 1993). The hybrid a-factor transporter encoded 
by the H5-AF508 gene is defective as a result of 
the AF508 mutation. Strain A is therefore unable 
to transport the mating pheromone a-factor 
efficiently. Since the transport of the a-factor 
pheromone is required for yeast cell mating, mating 
between strains A and B is inefficient. 
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The agar media is one rich in nutrients 
(nonselective for the growth of strains A and B) 
necessary for the optimal mating of yeast. 

An extract prepared from the plant genus 
Grindelia was prepared by grinding fresh Grindelia 
leaves in a mixture of ethanol and water (1:1). 
This extract contains the fraction of plant material 
soluble in a solution of 1:1 ethanol and water. 
Small filter paper discs were prepared using a 
standard paper punch and Whatman filter paper #2. A 
filter paper disc containing the Grindelia extract 
was prepared by adding 10 \xl. of extract to the 
disc, and then allowing the disc to air dry. 
Additional extract was added in 10 \il aliquots, 
allowing the disc to dry after each application 
until a total of 30 jil. was added to the disc. The 
disc was then placed upon the surface of the media, 
on top of the yeast cells. 

The mating plate was incubated for 7* hours at 
30°C to allow yeast cell mating to occur. 

Chemical compounds that diffuse out of the 
filter paper disc into the agar will come in contact 
with the mating cells. A component of the Grindelia 
extract promotes mating of strains A and B. 

The mating plate was replica printed to a 
selective media plate that is incubated at 30°C for 
three days to allow the growth of diploid colonies. 

The selective media is lacking lysine and 
uracil. Consequently, strains A and B cannot grow 
on this media (strain A has the lys2 mutation and 
strain B has the ura3-52 mutation: these mutations 
prevent them from growing on media without lysine 
and uracil respectively). However, diploids 
produced as a result of the fusion between strain A 
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and B can grow (diploids contain the genomes of 
both haploids and thus contain the complementing 
URA3 and LYS2 wild-type alleles). Production of 
yeast diploid cells in the area surrounding a leaf 
disc would be a strong indication that the leaf 
disc contained a compound which corrects the AF508 
defect. Diploids cells were easily detected by 
replica printing the yeast cells from the mating 
plate, onto another petri dish plate containing 
media which is selective for the growth of the 
diploid cells. This media will allow the diploid 
yeast cells to grow and form colonies , while 
preventing the growth of the haploid (unmated) 
strain A and B yeast cells. The freguency of 
diploid colonies produced on the selective plate is 
an indirect measurement of the ability of strain A 
to transport a-f actor and mate. 

A high density of yeast colonies on the 
selective media plate occurred in the location 
corresponding to the position where the filter 
paper disc containing the Grindelia extract 
occurred on the mating plate. This is clearly 
distinguishable from the low level of diploid 
colonies which are produced by the mating of the 
two yeast strains in the absence of a chemical 
compound that corrects the AF508 defect. (See 
Figure 3 . ) 

As can be seen in Figure 3 the extract 
reverses the effect of the AF508 mutation 
indicating its potential for further study. 

Thus it can be seen the invention accomplishes 
all of its objectives. 



WO 94/25607 



•CT/US94/04379 



26 - 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

,1) APK.1C.T1 0»i».r.ity of I~. Fo^ion. 

(iii) NUMBER OF SEQUENCES: 6 

(i v) CORRESPONDENCE ADDRESS: Thomte , Voorhees & Sease 

K / A ) ADDRESSEE: Zarley, McKee, in °™ • 

B» STREET : 801 Grand Ave. Suite 3200 

(C) CITY: Des Moines 

(D) STATE: Iowa 

(E) COUNTRY: US 

(F) ZIP- 50309 

( v) COMPUTER READABLE FORM 

1 (A) MEDIUM TYPE: Floppy disk 

B COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: ^-DOS/MS DOS ^ ^ 

(D) SOFTWARE: Patentln Release ix. , 

(vii) CURRENT APPLICATION DATA: 

(Al APPLICATION NUMBER: PCT/US94/ 
(B) FILING DATE: 20-APR-1994 

^ S C ^^SS^ 8 : « ",052,207 
(B) FILING DATE: 04-APR-1993 

(Viii) ATTORNEY / AGENT INFORMATION: 
{ ' (A) NAME: Nebel, BexdiS. 

a REGISTRATION NUMBER: 37,719 
( ( C) MFERENCE/DOCKET NUMBER: Uirf 93-50 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 515-288-3667 

(B) TELEFAX: 515-288-1338 

(2) INFORMATION FOR SEQ ID NO: 1: 

tL) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 384 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3840 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 1317.. 1488 

(D) OTHER INFORMATION: /note= "Substituted analogous 
sequence from human CFTR gene" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AAC TTT TTA AGT TTT AAG ACT ACA AAA CAC TAT CAC ATT TTC AGG 4 8 

Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His lie Phe Arg 
15 10 15 

TAC GTG AAC ATA CGG AAT GAC TAC AGG CTG TTA ATG ATA ATG ATA ATA 96 
Tyr Val Asn lie Arg Asn Asp Tyr Arg Leu Leu Met lie Met lie lie 
20 ~ 25 30 

GGT ACC GTG GCA ACA GGC CTA GTG CCG GCA ATT ACT TCT ATC CTG ACG 14 4 

Gly Thr Val Ala Thr Gly Leu Val Pro Ala He Thr Ser He Leu Thr 
35 40 45 

GGC AGA GTG TTC GAT CTA CTA TCA GTT TTC GTG GCT AAT GGG TCA CAT 192 
Gly Arg Val Phe Asp Leu Leu Ser Val Phe Val Ala Asn Gly Ser His 
50 ^ 55 €0 

CAA GGT TTG TAT TCC CAA CTA GTA CAG AGG TCA ATG GCA GTA ATG GCA 24 0 

Gin Glv Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala Val Met Ala 
€5 " 70 75 80 

CTT GGT GCG GCT TCT GTG CCA GTA ATG TGG CTT TCT CTA ACA AGT TGG 288 
Leu Gly Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp 
85 90 95 

ATG CAC ATC GGC GAG AGA CAA GGC TTT AGA ATA CGG TCA CAG ATA TTG 336 
Met His He Gly Glu Arg Gin Gly Phe Arg He Arg Ser Gin He Leu 
100 105 110 
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GAG GCA TAT TTG GAG GAA AAG CCA ATG GAA TGG TAC GAC AAT AAT GAA 384 
Glu Ala Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu 
115 120 125 

AAA TTG TTA GGA GAT TTT ACT CAA ATC AAC AGA TGT GTG GAA GAG CTA 4 32 

Lys Leu Leu Gly Asp Phe Thr Gin lie Asn Arg Cys Val Glu Glu Leu 
130 * 135 140 

AGA TCA AGC TCC GCA GAG GCA TCA GCC ATA ACT TTC CAG AAT TTA GTT 4 80 

Arq Ser Ser Ser Ala Glu Ala Ser Ala lie Thr Phe Gin Asn Leu Val 
145 150 155 160 

GCA ATA TGT GCG CTT CTG GGG ACG TCA TTC TAC TAT TCT TGG TCA TTA 528 
Ala lie Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu 
165 170 175 

ACT TTA ATT ATT CTT TGC AGC TCT CCA ATA ATC ACA TTT TTT GCA GTG 576 
Thr Leu lie He Leu Cys Ser Ser Pro He He Thr Phe Phe Ala Val 
180 185 190 

GTG TTT TCC AGA ATG ATT CAT GTA TAT TCA GAG AAG GAG AAT TCT GAA 624 
Val Phe Ser Arg Met He His Val Tyr Ser Glu Lys Glu Asn Ser Glu 
195 200 205 

ACG AGT AAA GCA GCC CAA TTA CTT ACA TGG TCG ATG AAT GCC GCT CAA 672 
Thr Ser Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin 
210 215 220 

TTA GTG AGA TTA TAT TGT ACA CAA CGT CTA GAA AGG AAA AAA TTC AAG 720 
Leu Val Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys 
225 230 235 240 

GAA ATC ATA CTA AAT TGT AAC ACT TTC TTC ATC AAG AGT TGC TTT TTT 76 8 

Glu He He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe 
245 250 255 

GTT GCT GCA AAC GCT GGG ATC TTG AGA TTT TTG ACG TTG ACT ATG TTT 816 
Val Ala Ala Asn Ala Gly He Leu Arg Phe Leu Thr Leu Thr Met Phe 
260 265 270 

GTA CAG GGA TTC TGG TTT GGT TCC GCA ATG ATC AAA AAG GGC AAG CTG 864 
Val Gin Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu 
275 280 285 

AAC ATT AAC GAT GTA ATC ACT TGC TTC CAT TCA TGC ATT ATG CTG GGC 912 
Asn He Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly 
290 295 300 

TCG ACT TTA AAT AAT ACA TTA CAC CAA ATA GTT GTT CTT CAA AAA GGC 960 
Ser Thr Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly 
305 310 315 320 
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* * S S s ^ w w> S - «• 5 j« e 

3 40 Asn Th ; GCC CAC c », 3*5 

sta tc _ 390 al G ^y Jf A g gc TCA _ 

£ J^ G T7- A _ r GJ y 2? *** re? ,„ 

s - £ ar $ - « T <rc c " as «r us e 
* * ffi s r * «, * * * s «» a 2 ; Arr 
- £ as - ^ Mc r - rhe - § sr s «, ^ <cc 

A r „ AGC Grc „ Tr „ 455 1 Se ? 2V r »f G M TAT «„ 

46 | - v ai - ^ caa CTa ^ ■ i» # « & 

5* GAG MA e ^ Sf flTc „ 

J?e Ji « VaT r Tr GGA r ,, ys «»e 

G? CGGG CAA, 5 ** «5 3? GGA ATC , 

GJ y Gly rf* Caa CAa . 49n ^ GJ V n ACA C7>r , 

^ Gln *«* GTT GCT „ 1S Thr S S GT 

■Act Cca 500 a? S ^ cst GCA 

- s s a p?c ^ GAc 505 AJa ^ s s s ff ^ 

«S ^ ^ A^ C G ^ GO, q 

5, -V. ls - S 

525 Sp Va/ 



^056 
^200 

1392 
1440 
1488 
1536 
1584 
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CRT CGC 
His Ar<3 
530 

Thr Thr 

TAT TTA 
Tyr ^ eU 



AAC CTG 
Asn 



RTC ATA 
lie H e 



^. »iG GCA 
t-TG ATG Ala 

Met LY S A 
ieu Met 5 J 3 5 

rAT gaG 

550 



ATT AGG 
lie Arg 



t tg agc 
Leu Ser 



CAT TGG AGG AAA 

* ivTT GAA TCT 
CAA ciu Set 

Gin H e ° 
555 



GGA AAG 

Giy ^ s 



GAT GAC 

ASP AsP 
ASP 560 



1632 
1680 



TAT TTK 



S £ » S 

565 



GAA GT ^ 

Gl« va *: 

570 



ACC CAG 
Thr Gin 
575 



TCT GAA 
Ser Glu 



CT T CTA 
I,eu Leu 
580 



GCC GAT CC G ACC 
Ala Asp * r ° 



ACT AC A 
Thr Thr 
585 



CTA GAG 
L eu Gin 



AAT GAC 
Asn Asp 
595 



GAA GAA 

ACA CCA 
Thr F*o 
625 

GAT CAG 
Asp Gin 



AAA TCT 
Lys Ser 



n TCT GAT GCG 

tac Tea Ala 
Tyr Ser asp ^ q 

ACT GTG 
His Thr VI 
H3 " 615 



AAA ACT 
Lys Thr 



*rc ACA TGG 

ttt agc a Tr 

phe Ser Thr ^ 
m rTA GAT ACA 

* TT „ i asp Thr 
II* Val 605 



TAT CAC 

Tyr ttis 



GAG ACT 
Glu Thr 



GAA AGT 
Glu Ser 



AAA CTT 
Lys L eU 



Giy |.r 



AGT AAT 
Ser Asn 



,. r TCT CAA 
TTT AAC T«- 
Ph e Asn Ser 

620 

„„- «. R T GAT 

S S w 

635 



T xG GAA 
L eu Glu 



GAG ACA 
Glu Thr 
640 



TTG TCC 
l,eu Ser 



S S « S 

645 



R TC TAT 
^ 750 



CAA AAA A f TCG 
Gin M s Ar9 



AAC GTT 
Asn val 
655 



AGA ACA 
Arg Thr 



AGA AGG 

Arq Ar< 3 
* ^ 660 



<n GAA 
GTT AAA GTX 

Val ltf s Va 



GAG GAA 
Glu Glu 
665 



AAT ATT 
Asn lie 



GGG TAT 
GlY Tyr 
* 670 



GCA CTA 
Ala Leu 



AAA CAA 
Lys Gin 



caa aag 

Gin LY S 
675 



AAC ACC 
Asn Thr 



GAA AGT 
Gin Ser 
680 



TCA ACA 
Ser Thr 



GGG CCA 
ciy * r ° 



CAA- CTT 
Gin *>eu 
685 



CTG AGC 
Leu Ser 



ATT ATT 

lie lie 
690 

ATT CTA 
lie L« u 
705 

CCC GTC 
pro val 



CAG ATT 
Gin U e 



ATC AAA 
lie LY S 



AGA ATG 
695 



ATT AAA 
lie Lys 



AGC ATA 

S er I!* 
700 



AGA TAC 
Arg Tyr 



AAA AAA 
Lys ltf s 



ATC TTG 
lie Leu 



GGA CTG 
Giy 

710 



CTA TGT 
Leu Cys 



TCT CTT 
Ser Leu 



ATC GCA 
lie Ala 
715 



GGC GCC 

ciy 



ACA AAT 
Thr Asn 
120 



rrr tca 

Phe Ser 



Tyr Thr 

725 



T TC AGT 
Phe Ser 



TTC TTA 



CTA GAA 
Leu Gin 



GGA ATT 

Giy Ile 



GTC CCA 
val 
735 



1728 
1776 
1824 
1872 
1920 
1968 
2016 
2064 

211 2 
2160 
2208 
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TCC ACG GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG 2256 
Ser Thr Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser 
740 745 750 

CTT CTT GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GCT 2 304 

Leu Leu Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala 
755 760 765 

AAA GGA TTC CTA TTA GAT TGC TGC AGT GAA TAC TGG GTT ATG GAT CTT 2352 
Lys Gly Phe Leu Leu Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu 
770 775 7 8 o 

AGA AAT GAA GTT ATG GAA AAA CTG ACG AG A AAG AAT ATG GAC TGG TTT 24 00 

Arg Asn Glu Val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe 
785 790 795 800 

TCT GGT GAA AAC AAC AAG GCT TCT GAA ATT TCT GCT CTA GTC TTG AAT 2448 
Ser Gly Glu Asn Asn Lys Ala Ser Glu He Ser Ala Leu Val Leu Asn 
805 810 815 

GAT TTG CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG 24 96 

Asp Leu Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met 
820 825 830 

ACT AGT TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA GTA 2544 
Thr Ser Phe Val Thr Val Ser Thr He Gly Leu He Trp Ala Leu Val 
835 840 845 

TCG GGC TGG AAG TTA AGT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA 2592 
Ser Gly Trp Lys Leu Ser Leu Val Cys He Ser Met Phe Pro Leu He 
850 855 860 

ATT ATA TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA 264 0 

He He Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr 
865 870 875 880 

GAT TAT AAG ACA TCT GTT GCT CAG TTA GAA AAC TGC CTG. TAC CAG ATT 26 88 

Asp Tyr Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He 
885 890 895 

GTC ACT AAC ATT AAA ACC ATT AAG TGC TTA CAA GCT GAA TTT CAT TTT 2736 
Val Thr Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe 
900 90S 910 

CAA TTG ACC TAC CAT GAC TTG AAG ATA AAA ATG CAA CAA ATT GCC TCC 27 84 

Gin Leu Thr Tyr His Asp Leu Lys He Lys Met Gin Gin He Ala Ser 
915 920 925 

AAA CGC GCC ATT GCC ACA GGA TTT GGT ATA TCT ATG ACA AAC ATG ATT 2832 
Lys Arg Ala lie Ala Thr Gly Phe Gly He Ser Met Thr Asn Met He 
930 935 940 



ft 
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Val Met Cys He Gl ^ ^ ^ 

- s s a s s s a « ss " s ~ ~ * " 
- S issssssS ss5l '- ss,u 

Leu Leu Phe ^ 

> CGT GCT GCC ACT TGG ATC TAT ^ ^ 

GAT ATA AGT AGA g «J £ f la Ala Ser Trp He «g 5 
Asp 11© Ser Arg wy l000 

" 5 =M GTT G« «C »» »I S S 

« •» - s T . s s si s: s ... « 

« «s 1915 MC m „ r - » if. 

.-a GCT GGT CAC ACC TAC CAT 6 Gln Lys LyS Pro 

GTG GGA ATA GCT Gfc ^ Tyr Bis Giy 

Val Gly He Ala y io30 gcc 
1025 ACA TTT GCC TAT CCA TCT GCA CCT ^ ^ 

^ TCA ATT CAA AAT TTG ACA TT1 Tyj . pro Ser Ala ^ 

GTT TCA ATI Gij> Leu Thr Phe ^ 

10 " TG MT « -c - ? 25 S S S S 

1 ^ » « « - r s s s s s s 

« g g 5 S5 S «v ~ •» - i- 

™ •» r s s £ " " a «v - » ; r ~ 

Le u Thr Lys ^e u 1095 TCA 3360 

r« * « - - S S K Si S » S£ - So 

ACG GAC GTA ARi Tj . p Asn ^ Thr fi 

Thr Asp Val Asn v 

CTA ACT TAT GGT TTA CAA GAi n# Leu Giu lie u50 

Su Thr Tyr Gly ^u Gl, Asp n45 



2880 
2928 
2976 
3024 
3072 
3120 
3168 
3216 



3264 
3312 



3408 
3456 
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s s s s a c s i ss s a s ss s a 

1155 

„ „ r . , CT TO CTG TCA GGT GGA CAA GCG 
GGC TTG GAT ACA CGT ATT GAT ACA ACT TTA Qly ^ Ala 

Gly Leu Asp Thr Arg lie **P™ T 1180 

as is s s s a s = = i ^ - » = s. 

sis5=s====a= sss a s 
s a s s s ss s s ss «• = = s= S 

1220 

SSBSSSSS5S5SS5S- 

1235 

„ rrT TTC GAC ACT TTA TAT 

S £ S T , " £ 5S %p S 5 2. s - - - - 

sssssssssssssss- 

1265 iZ 



(2) INFORMATION FOR SEQ ID N0:2: 

„, SEOUENCE CHARACTERISTICS t 

11 ? A, LENGTH: 1279 amino adds 

(B| TYPE: amino acid 

(D) TOPOLOGX: linear 

(ii) MOLECULE TYPE: protein 

, xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

« J. L - ~ ~ «. * - «a ~ - - «s « 

^ v.! .... n. « « « ^ «g - - - ue "S "* 

01y _ T, ». «, - « » - ~ s :i u * ~ Tht 



3696 



WO 94/25607 



PCT/US94/04379 



- 34 - 



Gly Arg val Phe Asp Leu Leu Ser Val Phe Val Ala Asn CI, Ser His 

50 55 
Gin Gly Leu Tyr Ser Gin Leu Val Gin Arg Ser Het Ala Val Met Ala 

Leu Gly Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp 

85 yo 
Met His lie Gly Glu Arg Gin Gly Phe Arg He Arg Ser Gin lie Leu 

100 105 
Glu Ala Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu 

Lys Leu Leu Gly Asp Phe Thr Gin He Asn Arg Cys Val Glu Glu Leu 

130 135 
Arg Ser Ser Ser Ala Glu Ala Ser Ala lie Thr Phe Gin Asn Leu Val 
145 150 I" 

Ala lie Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu 

165 170 
Thr Leu He He Leu Cys Ser Ser Pro He He Thr Phe Ph. Ala Val 

180 185 
Val Phe Ser Arg Met He His Val Tyr Ser Glu Lys Glu Asn Ser Glu 



195 200 



Thr Ser Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin 



210 



Leu Val Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe I*. 



225 



230 



Glu He He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe 

245 250 
Val Ala Ala Asn Ala Gly lie Leu Arg Phe Leu Thr Leu Thr Met Phe 



260 265 



Val Gin Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu 

280 ZOJ 



275 



Asn He Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly 



290 



295 



Ser Thr Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly 



305 



310 
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Tie Met Thr Leu Leu Lys Asp Gly Mr Lys 
Gly v.l Ala Met Glu Lys He Met ^ 

. r1 „ Phe pro Leu Asp Tyr 
Teu Asn Lys Thr Val Ala His Gin Phe 
Arq Asn Pro Leu Asn uy 345 

34 <u.r Tvr Pro Ser 

Teu Thr Phe Ala Asn Val Ser Phe Ser Tyr 
Ma Thr Ser Asp Leu Thr vn ^ 

355 , val Ser Leu Asn Phe Ser Ala 

Arg pro ser Glu Ala Val Leu Lys Asn Val ^ 

h xle Val Gly W Ser Gly Ser Gly Lys Ser Thr 
6ly Gin Phe Thr Phe II. Val Gly 3g5 

385 _ Glv Tyr Asn Gly Ser He 

u. s« - - 5s - " 9 ~ " S 

Glv ,i, ». ii. *» M " Gln if. w 111 

Ser He Asn Gly His Asn 425 

«u <5er Trp He Met Pro Gly Thr 
Tle Thr Val Val Glu Gin Phe Ser Trp ^ 
Glu Asn He Tnr va 44Q 

435 „ s„r Tvr Asp Glu Tyr Arg Tyr 

Glu Asn He He Phe Gly Val Ser Tyr A P 
lie Lys Glu Asn a 455 

Ma CVS Gin Leu Glu Glu Asp He Ser Lys Phe 
^ Ser Val He Lys Ala Cys Gin ^ 

Asn He Val Leu Gly Glu Gly Gly He Thr Leu Ser 
Ala Glu Lys Asp Asn He 490 

Gly Gly Gin Gin Gin Arg ^ 

Thr Pro He Leu rne 520 

515 Ala He Arg His Trp Arg Lys Gly Lys 

His Arg Asn Leu Leu Met Lys Ala He Arg ^ 

i Thr His Glu Leu Ser Gin He Glu Ser Asp Asp 
Thr Thr He He Leu Thr His 555 

545 i G io Gly Glu Val Val Glu Ser Gly Thr Gin 

Tvr x-u Tyr Leu Met Lys Glu Gly ^ 

Ala Asp pro Thr Thr Thr Phe Ser Thr Trp Tyr His 
Ser Glu Leu Leu Ala Asp Fr ^ 
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Leu Gin Asn Asp Tyr Ser 
595 

Glu Glu Lys Ser He His 
610 

Thr Pro Lys Leu Gly Ser 
625 630 

Asp Gin Leu Ser Phe Tyr 
645 

Arg Thr Arg Arg Val Lys 
660 

Lys Gin Gin Lys Asn Thr 
675 

He He Gin He He Lys 
690 

He Leu He Leu 

705 

Pro Val Phe Ser 



Asp Ala 
600 

Thr Val 
615 



Lys Thr 
Glu Ser 



He Val Asp Thr Glu Thr 
605 



Cys Leu Ser Asn 
Glu Ala 
Val Glu 



He Tyr 
650 



Gly Leu 
710 

Tyr Thr 
725 



Ser Thr 

Leu Leu 

Lys Gly 
770 

Arg Asn 
785 

Ser Gly 
Asp Leu 
Thr Ser 



Ser Gly 
850 



Asp Gly 
740 



Lys Thr 
Gly Val 
Phe Leu Leu Asp 



Val Leu 
755 



Glu Val 
Glu Asn 



Met Glu 
790 

Asn Lys 
805 



Arg Asp 
820 

Phe Val 
835 



Leu Arg 
Thr Val 
Trp Lys Leu Ser 



Glu Ser 
680 

Arg Met 
695 

Leu Cys 

Phe Ser 

Gly Ser 

Ala Ala 
760 

Cys Cys 
775 

Lys Leu 
Ala Ser 
Ser Leu 



Ser Thr 
84 0 

Leu Val 
855 



Glu Glu 
665 

Ser Thr 

He Lys 

Ser Leu 

Phe Leu 
730 

Ser His 
745 

Ala Asp 

Ser Glu 

Thr Arg 

Glu He 
810 

Val Ser 
825 

He Gly 
Cys He 



Phe Asn 
620 

Leu Gly 
635 

Gin Lys 

Asn He 

Gly Pro 

Ser He 
700 

He Ala 
715 

Leu Glu 

Tyr Leu 

Gly He 

Tyr Trp 
780 

Lys Asn 
795 

Ser Ala 
Glu Phe 
Leu He 



Ser Met 
860 



Ser Gin Leu Glu 



Tyr Asp 
Arg Ser 



Gly Tyr 
670 

Gin Leu 
685 



Glu Thr 
640 

Asn Val 
655 

Ala Leu 



Leu Ser 



Arg Tyr Lys Lys 
Gly Ala 
Gly He 



Thr Asn 
720 



Val Pro 
735 



Trp Ser 
Phe Ala 
Val Met Asp Leu 



Ala Lys 
750 

Phe Asn 
765 



Met Asp Trp Phe 
800 

Leu Val Leu Asn 
815 

Leu Ser Ala Met 
830 

Trp Ala Leu Val 
845 

Phe Pro Leu lie 
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J£ «• Phe ser Ala i le Tyr 

«T He Leu Gln Lys Cys ^ ^ 

ASP ^ Thr Ser Val Ala G5 „ , 880 

Gln Leu g „ Asn cys t ^ 

V " T " As " III * - «• cys Leu Gln Ala Giu ph 895 

Gln Leu Thr Tvr * 910 

" P SB -» »« «. - a. „. s„ 
- » - - «• jj, 01y „. ^ » 

935 Thr Asn Met ll e 

2S CyS <*» Al. lie Xle Ty r Tyr „ ! 

950 ^ Ty r Tyr Gly Leu Lys Leu 

Met He His G1 - 960 

" Ly ' 510 SJ - - ~ «- * ^ u 

*" "* S "« - <* - s« ta « s „ „ 

985 X ber G ^n He p ro 

„ 1000 »« te9 !1 * 
m . - * g. ^ v>1 wu „ n s<n 

a. «. «, Bi , TOr ^ Bis " 2 " 

1030 y HiS Glu Ly s Lys » 

Va1 - 1035 * yE Pro J le 

" Ser Gln Asn ^ , 1040 

^ ^V" ^ ^ Thr Ala 

" V " & As " - - «- net Phe cys „ G1 1055 

1065 ys G1 y Gln Thr Le U 

G ly He He Gly Glu Ser ei „ 1070 

- Jjr ^ u. Tyc A „ cys «. 

1095 biy L y g He Lys i ie As 

If5 5 -P V.! s „ 4sp ^ l= 

*" G1 » "« - 

1 Vil M » cl » Pro u. u. p„. , n2 ° 
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r , u He Glu Met Tyt Asp 
t Gin Asp Glu He Leu Glu lie 
L eu Thr Tyr Gly Leu Gln ASP 1145 

• „ SO Phe Va! He Ser Ser Pro Gin 

„> v*\ GlV He His Asp Phe v 
Ala Leu Lys Tyr Val Gly 

I 155 tou Ser Gly GlV Gln Ala 

Tht Arg He Asp Thr Thr Leu Leu Ser^ 
Gly Leu Asp Tht Arg il75 

1170 Ala Leu Leu Arg Lys Ser Lys He * 

Te u Cys He Ala Arg Ala Leu ^ 
Gln Arg Leu Cys gQ 

1185 , h Ser Ala Leu Asp Ser Val Ser Ser Ser^ 

ti- Leu Asp Glu Cys Thr Ser Ala 1JU 

Ile Leu A p 12Q5 ^ Val 

, ,= Giv Pro Pro Ala Leu 
Ile Asn Glu He Val Lys Lys Gly 

Ile Asn i220 Val 

■ , r Glu Gin Met Met Arg Ser Cys Asn Se^ 
He Thr His Ser Glu fcr 1240 

1235 , n=n Phe Asp Thr Leu Tyr 

As* Gly Lys val val Glu Arg Gly Asn Ph- 
Leu Lys Asp Gly "X l255 

1250 k rin He Val Ser Asn Gln Ser Ser 

a™ GlV Glu Leu Phe Gln lie 
Asn Asn Arg Giy ^ l2?0 

1265 



(2) INFORMATION FOR SEQ ID «0:3: 

» S&SSSK s ngle 
(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genome) 
(iii.) HYPOTHETICAL: NO 
(i V) ANTI-SENSE: NO 

^ Ta) 1 ofGANfsMfsaccharo.yces cerevisiae 

M ^TSe/KEY: COS 

(B) LOCATION: 1-.384 0 

™ ^NAME/KEY: ^-*l%r 
(B) LOCATION: 1318.. 
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(D) OTHER INFORMATION: /note- "Substituted analogous 
sequence from human CFTR gene" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG AAC TTT TTA ACT TTT AAG ACT ACA AAA CAC TAT CAC ATT TTC AGG 
Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His lie Phe Arq 
1 5 10 15 

Tvr vl? J* n ??* ?* T GAC TAC AGG CTG WA ATG ATA ATG ATA ATA 

Tyr Val Asn lie Arg Asn Asp Tyr Arg Leu Leu Met lie Met He He 

20 25 30 

GGT ACC GTG GCA ACA GGC CTA GTG CCG GCA ATT ACT TCT ATC CTG ACG 
Gly Thr Val Ala Thr Gly Leu Val Pro Ala lie Thr Ser lie Leu Thr 
35 40 45 

GGC AGA GTG TTC GAT CTA CTA TCA GTT TTC GTG GCT AAT GGG TCP CAT 
Gly Arg Val Phe Asp Leu Leu Ser Val Phe Val Ala Asn Gly Ser SI 
50 55 60 * 

CAA GGT TTG TAT TCC CAA CTA GTA CAG AGG TCA ATG GCA GTA ATG GCA 
Gin Gly Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala vll Met Sa 
65 70 75 80 

l2 r?3 G ? G G ? T l CT 2*? CCA CTA ATG TGG OT TCT CTA ACA ACT TGG 
Leu Gly Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp 
85 90 95 

Met h? C t? C r GC GAG AGA CAA GGC 717 AGA ATA GGG TCA CAG ATA TTG 
Met His lie Gly Glu Arg Gin Gly Phe Arg He Arg Ser Gin lie Leu 

100 105 no 

r AG GCA « AT P G GAG GAA AAG CCA AT G GAA TGG TAC GAC AAT AAT GAA 
Glu Ala Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp ten ten Glu 

120 125 

AAA TTG TTA GGA GAT TTT ACT CAA ATC AAC AGA TGT GTG GAA GAG CTA 
Lys Leu Leu Gly Asp Phe Thr Gin lie Asn Arg Cvs Val Glu Glu Leu 
1JJU 135 !4 0 

AGA TCA AGC TCC GCA GAG GCA TCA GCC ATA ACT TTC CAG AAT TTA GTT 
Arg Ser Ser Ser Ala Glu Ala Ser Ala lie Thr Phe Gin isn £ Val 
145 150 155 160 

GCA ATA TGT GCG CTT CTG GGG ACG TCA TTC TAC TAT TCT TGG TCA TTA 
Ala lie Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu 
i65 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



4 32 



480 



528 
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-~ss=ss=S BSS?s « s5S 

GT A AT - GAG AAC ffi - £ S! 
GTG TTT ** ACA g. -J g S £ Ser «» 

GCC CAA TTA CTT ACA TGG ^ Asn R la Al 

ACG ACT AAA GCA GCC ^ LeU Thr Trp ^ 

Thr Ser I** Rla A1 215 ^ AAA TTC AAG 

210 ^ CGT CTA GAA AGG AAA Lys 

,„ TTA TAT TGT ACA CAA CGT ^ Arg Lys Lys 24Q 

TTA GTG AGA TTA l Thr G m Arg 

Leu Val Arg Leu T V 2 y 30 W TTT 

225 r ^ TTC ATC AAG AGT TS ^ phe 

™ TTA AAT TGT AAC ACT TT u# Lys Ser y ^ 

GAA ATC ATA CTA £ Asn Thr Phe 

Glu He He Leu ^ ^ 

„«—ssffi=3 ssa=,ss 

^ "* - B J, ggt w I - S - S S S S 

ra » CAG 0» «5 S S =«' ^ " =» 5 
IS «» «» " P 2! ° „ r atg ctg ggc 

, act tgc ttc cat tca tgc att b teu Gly 

AAC ATT AAC CAT CTA ATC ACT « „„ Bi . S « C,; 

»„ M . A.n ASP V.1 2 „ OT CAA AAA GCC 

2 " a AAT AAT ACA TTA CAC CAA ATA CTT £ « a „ „ y , C„ 

2 £ S S - SS M " - 

mG GCT ATC GAA AAA ATC « g « £ £ ffi 
S 55 S «« S „ CCA CTA GAT TAT 

" " " ACA TTT GCT « - £ S S 5 ^ ~ 

CCC ACC ACT GAT CTA ACA TTT ... S« „, 

Ala Thr Ser as P 36 o GCA 

3 " G CA GTT TTA AAG AAC GTT AGT TTA £ Ser Ale 

AGA CCT TCG GAA GCA GTT ^ Asn val 38() 

Arg Pro Ser Glu Axe 375 



624 



€■72 



720 



768 

864 
912 
960 
1008 
1056 
1104 
1152 
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m „ R rrr TCA GGT AAA TCT ACA 

sasssBssssg— -s 

385 rr , TRT AAT GGA TCG ATA 

rr « C TTA TTA TTA AGG TTC TAG GAT GGC TAT AA^ ^ ^ 

TTA TCC AAC TTA Tl* * h£ Tyr As p Gly iy* 415 

Leu Ser Asn Leu Leu Leu Arg o0 

450 r , r r *r ATC TCC AAG TTT 

s e - « " ?. - - 2 25 S - "* £ " » 

465 r „ rrT gga ATC ACA CTG AGT 

«. M »» « « g S SSSS S « - - s « 

Ala Glu Lys Asp Asn lie 49Q 

_„ f- C R TTC ATC AGA GAT 

s 5 a s s a s is s s - - s: - ** p 

Thr Pro He Leu rn 520 

515 n«r »rr CAT TGG AGG AAA GGA AAG 

S S K S S S § S S 5 S « - - - 

530 arr CAA ATT GAA TCT GAT GAC 

- - s s s s k its ss s s n. «. ~ - » 

565 ^ *rr ACA TGG TAT CAC 

„ „ « s « « - g g S J2 S - S5 8i ' 

Ser Glu Leu Leu Ala Asp r ^ 
580 
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CTA CAG AAT GAC TAC TCT GAT GCG AAA ACT ATT GTA GAT ACA GAG ACT 
Leu Gin Asn Asp Tyr Ser Asp Ala Lys Thr ^ 
595 600 

s s s: s s s i = = = S 5 ~ =5 - s 

625 630 

S S= E S 5? K Si S ?H - " S 5 K S 

660 665 
AAA CAA CAA AAG AAC ACC GAA AGT TCA ACA GGG C£ CAA CTT CTC AGC 
Lys Gin Gin Lys Asn Thr Glu Ser Ser inr y ^ 
675 680 

s s s ?n if. s s sn s s: = s " - " " 

5 2 S g SS S 5 12 S S S S S S K 

705 710 

CCC GTC TTT TCA TAC ACA TTC AGT TTC TTA CTA GAA GGA ATT GTC CCA 
Pro Val Phe Ser Tyr Thr Phe Ser Phe Leu u J 7 35 

725 730 

TCC ACG GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG 
Ser Thr Asp Gly Lys Thr Gly Ser Ser Bis Tyr Le . 
740 745 
CTT GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GCT 
Leu Leu Val Leu Gly Val Ala Ala Ala Asp ^ 

GGA TTC CTA TTA GAT TGC TGC AGT GAA TAC TGG GTT ATG GAT CTT 
Lys Gly Phe Leu Leu Asp Cys Cys Ser Glu Tyr 

> M Z «» «. » if "t 2 g 5! i2 S S C P S S 

Arq Asn Glu Val Met Glu Lys Leu Thr Arg i-y 800 
785 790 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 
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TCT GGT GAA AAC AAC AAG GCT TCT GAA ATT TCT GCT CTA GTC TTG AAT 244 8 
Set Gly Glu Asn Asn Lys Ala Ser Glu lie Ser Ala Leu Val Leu Asn 
810 B - 1 - > 



805 



GAT TAT AAG ACA TCT GTT GCT CAG TTA GAA AAC TGC CTG TAG CAG ATT 
Asp Tyr Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin lie 



885 



GTC ACT AAC ATT AAA ACC ATT AAG TGC TTA CAA GCT GAA TTT CAT TTT 
Thr Xsn fie Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe 
905 Slu 



900 



CAA TTG ACC TAC CAT GAC TTG AAG ATA AAA ATG CAA CAA ATT GCC TCC 
Gin Leu Thr Tyr His Asp Leu Lys lie Lys Met Gin Gin lie Ala Ser 

920 925 



915 



930 



945 



GAT TTG CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG 2496 
Asp Leu Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met 
820 825 830 

ACT ACT TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA CTA 2544 
Thr Ser Phe Val Thr Val Ser Thr lie Gly Leu He Trp Ala Leu Val 
835 840 845 

TCG GGC TGG AAG TTA ACT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA 2592 
Ser Gly Trp Lys Leu Ser Leu Val Cys lie Ser Met Phe Pro Leu He 
850 855 860 

ATT ATA TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA 264 0 

S lie Phe Ser Ala He Tyr Gly Gly lie Leu Gin Lys Cys Glu Thr 
865 870 875 880 



2688 



2736 



2784 



AAA CGC GCC ATT GCC ACA GGA TTT GGT ATA TCT ATG ACA AAC ATG ATT 2832 
Lys Arg Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met lie 

935 940 



GTC ATG TGT ATC CAA GCT ATT ATT TAC TAC TAT GGC CTA AAG CTG GTT 2880 
Val Met Cys He Gin Ala He He Tyr Tyr Tyr Gly Leu Lys Leu Val 
950 955 9 60 



ATG ATT CAC GAG TAC ACC TCA AAG GAA ATG TTT ACG ACT TTC ACT TTG 2928 
Met lie His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu 
965 970 975 

TTA TTA TTC ACT ATT ATG TCA TGC ACT AGC CTA GTA AGT CAG ATA CCC 2976 
Leu Leu Phe Thr He Met Ser Cys Thr Ser Leu Val Ser Gin He Pro 
980 985 990 

GAT ATA AGT AGA GGC CAA CGT GCT GCC AGT TGG ATC TAT AGG ATT CTT 3024 
Asp lie Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg He Leu 
V 995 1000 1005 
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GAT GAA AAG 
Asp Glu Lys 
1010 


CAT 
His 


AAT 
Asn 


ACC 
Thr 


CTA GAG 
Leu Glu 
1015 


GTT 
Val 


GAA 
Glu 


AAC 
Asn 


AAT AAT 
Asn Asn 
1020 


GCT 
Ala 


AGA 
Arg 


ACA 
Thr 


3072 


GTG GGA 
Val Gly 
1025 


ATA 
He 


GCT GGT 
Ala Gly 


CAC ACC 
His Thr 
1030 


TAC 
Tyr 


CAT 
His 


GGC AAA GAA 
Gly Lys Glu 
1035 


AAA 
Lys 


AAA 
Lys 


CCA 
Pro 


ATC 
He 
1040 


3120 


GTT 
Val 


TCA 
Ser 


ATT 
He 


CAA 
Gin 


AAT TTG 
Asn Leu 
1045 


ACA 
Thr 


TTT 
Phe 


GCC 
Ala 


TAT CCA 
Tyr Pro 
1050 


TCT 
Ser 


GCA 
Ala 


CCT 
Pro 


ACC GCC 
Thr Ala 
1055 


3168 


TTT 
Phe 


GTT 
Val 


TAC 
Tyr 


AAA AAC 
Lys Asn 
1060 


ATG 
Met 


AAT 
Asn 


TTT GAC ATG 
Phe Asp Met 
1065 


TTT 
Phe 


TGC 
Cys 


GGA 
Gly 


CAG ACG 
Gin Thr 
1070 


TTA 
Leu 


3216 


GGT 
Gly 


ATC 
He 


ATT GGT 
He Gly 
1075 


GAA 
Glu 


TCA 
Ser 


GGC 
Gly 


ACA GGA 
Thr Gly 
1080 


AAG 
Lys 


TCT 
Ser 


ACA 
Thr 


CTT GTG 
Leu Val 
1085 


CTT 
Leu 


TTA 
Leu 


3264 


TTA 
Leu 


ACA AAA 
Thr Lys 
1090 


CTT 
Leu 


TAT 
Tyr 


AAT 
Asn 


TGT GAA 
Cys Glu 
1095 


GTA 
Val 


GGC 
Gly 


AAA 

Lys 


ATT AAA 
He Lys 
1100 


ATA 
He 


GAC 
Asp 


GGT 
Gly 


3312 


ACG GAC 
Thr Asp 
1105 


GTA 
Val 


AAT 
Asn 


GAC 
Asp 


TGG AAT 
Trp Asn 
1110 


TTG 
Leu 


ACA 
Thr 


AGT 
Ser 


TTA AGA 
Leu Arg 
1115 


AAA 
Lys 


GAA 
Glu 


ATT 

He 


TCA 
Ser 
1120 


3360 


GTG 
Val 


GTT 
Val 


GAG 
Glu 


CAA 
Gin 


AAA CCT 
Lys Pro 
1125 


TTA 
Leu 


TTA 
Leu 


TTC 
Phe 


AAT GGA 
Asn Gly 
1130 


ACC 
Thr 


ATC 
He 


AGA 
Arg 


GAT AAC 
Asp Asn 
1135 


3408 


CTA 
Leu 


ACT 
Thr 


TAT 
Tyr 


GGT TTA 
Gly Leu 
1140 


CAA 
Gin 


GAT 
Asp 


GAA 
Glu 


ATA CTT 
He Leu 
1145 


GAA 
Glu 


ATT 
He 


GAA 
Glu 


ATG TAT 
Met Tyr 
1150 


GAT 
Asp 


3456 


GCA 
Ala 


TTA 
Leu 


AAA TAC 
Lvs Tyr 
1155 


GTA 
Val 


GGA 
Gly 


ATC 
He 


CAT GAC 
His Asp 
1160 


TTT 
Phe 


GTA 

Val 


ATT 
He 


TCA TCA 
Ser Ser 
11^5 


CCT 
Pro 


CAG 
Gin 


3504 


GGC 
Gly 


TTG GAT 
Leu Asp 
1170 


ACA 
Thr 


CGT 
Arg 


ATT 
He 


GAT ACA 
Asp Thr 
1175 


ACT 
Thr 


TTA 
Leu 


CTG 
Leu 


TCA GGT GGA CAA 
Ser Gly Gly Gin 
1180 


GCG 
Ala 


3552 


CAA AGG 
Gin Arg 
1185 


CTT 
Leu 


TGC 
Cys 


ATA 
He 


GCC AGA 
Ala Arg 
1190 


GCA 
Ala 


CTT 
Leu 


CTG 
Leu 


AGA AAA 
Arg Lys 
1195 


TCA AAA 
Ser Lys 


ATT 
He 


CTG 
Leu 
1200 


3600 


ATT 

He 


TTA GAT 
Leu Asp 


GAG 
Glu 


TGT ACT 
Cys Thr 
1205 


TCA 
Ser 


GCC 
Ala 


TTG 
Leu 


GAT TCT 
Asp Ser 
1210 


GTC 
Val 


AGC 
Ser 


TCC 
Ser 


TCT ATC 
Ser He 
1215 


3648 
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JSSSSaSSSBSESSBSBS 
SSSSSSSSSS5SS5SS 

- - S5 £ !5 S° S S K E S S S " 

Leu Lys Asp Gly Ly s via 12 bO 

1250 ' 1W GTT TCC AAC CAA AGC AGT TAA 

S K 5 S S = = « s s S.- Gln ser ser 



!265 1270 



(2) INFORMATION FOR SEQ ID NO: 4: 

HI SEQUENCE CHARACTERISTICS: 

(Xl ^) LENGTH : 1279 a^ino acids 

( B ) TYPE : amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

« J. 1 - ~ - - - - -S - "5 " 9 

^ « „ n. « » ~ * «3 - " - "* 

Gly Thr V. l ~ - ~J « - "« * *S - W 

sly M9 « p.. -p - - - « - ~ is ** cls s " Bi * 
«. X - ~ - ns - VI1 cl " w » ~ il * " "* 

j: «, ~ « « « - «s - ~ - Thi s - Trp 

„ «. ,u «, - « - * S ~ u * " a ~ 5S ue ~ 
51o », - «. «. «• 5 - «■ » ~ 3 - - "* 
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Lys Leu Leu Gly Asp Phe Thr Gin He Asn Arg Cys Val Glu Glu Leu 
130 " 135 140 

Arg Ser Ser Ser Ala Glu Ala Ser Ala He Thr Phe Gin Asn Leu Val 
145 150 155 160 

Ala He Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu 
165 170 175 

Thr Leu He He Leu Cys Ser Ser Pro He He Thr Phe Phe Ala Val 
180 " 185 190 

Val Phe Ser Arg Met He His Val Tyr Ser Glu Lys Glu Asn Ser Glu 
195 200 205 

Thr Ser Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin 
210 215 220 

Leu Val Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys 
225 " 230 235 240 

Glu He He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe 
245 250 255 

Val Ala Ala Asn Ala Gly He Leu Arg Phe Leu Thr Leu Thr Met Phe 
260 265 270 

Val Gin Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu 
275 280 285 

Asn He Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly 
290 ' 295 300 

Ser Thr Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly 
305 310 315 320 

Gly Val Ala Met Glu Lys He Met Thr Leu Leu Lys Asp Gly Ser Lys 
325 330 335 

Arg Asn Pro Leu Asn Lys Thr Val Ala His Gin Phe Pro Leu Asp Tyr 
34 0 345 350 

Ala Thr Ser Asp Leu Thr Phe Ala Asn Val Ser Phe Ser Tyr Pro Ser 
355 360 365 

Arg Pro Ser Glu Ala Val Leu Lys Asn Val Ser Leu Asn Phe Ser Ala 
370 375 380 

Gly Gin Phe Thr Phe He Val Gly Lys Ser Gly Ser Gly Lys Ser Thr 
385 390 395 400 
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„. s « ... - lj. u. «. - ^ - "» - Gly !5 116 
s« „, « «, - «. s »• - - «*• a "* 

420 J *^ 

c _ TT - n Tie Met Pro Gly Thr 
Ota Asn lie Thr Val Val Glu Gin Phe Ser Trp ^ 

435 

u . „. «, « a. I- Pn. «, v.! s.r V Jg «- ^ « 
„, Z «1 u; =*■ «• - «• S5 " "* ~ ~ K 

I" «. «. ».p s « « °» s; Gly Gls Tht s 6 " 

Gly «, «. „ U. «. ». ~ jg >u «, «. - - « ~ 

500 

Thr pro n. ». - - -» S ~ "* ™ " P " 

515 

Br. «, «n ,.u « « jg « »« - ** So - W - " y ^ 
Thr Tr U. - • - - « ~ S ~ S " " » 

M w « K - - - s « - ~ " y s gi - 

s.r «. ... « - «r - - Jg * s " ™ SS ^ 

Mu «, „ C T,r s.r «P JU - - ~ S T " ™ 

595 

G1 „ ^ „s S.r ». »U Jg v.! «. - ». g « - - « 
Th r Z - «» -J *■ - ~ - SB ™ ^ «» 

Z - - - a - «■ "* G1 " w * A " s,t s v * 1 
«, «, «, « ? - - «- a cl " k '° u " Gly «• ** M ° 
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Lys Gin Gin Lys Asn Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu Ser 
675 680 685 

lie lie Gin lie lie Lys Arg Met lie Lys Ser lie Arg Tyr Lys Lys 
690 695 700 

lie Leu lie Leu Gly Leu Leu Cys Ser Leu lie Ala Gly Ala Thr Asn 
705 710 715 720 

Pro Val Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly lie Val Pro 
725 730 " 735 

Ser Thr Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser 
740 745 ~ 750 

Leu Leu Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala 
755 760 " " 765 

Lys Gly Phe Leu Leu Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu 
770 775 " 780 

Arg Asn Glu Val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe 
785 790 " 795 800 

Ser Gly Glu Asn Asn Lys Ala Ser Glu He Ser Ala Leu Val Leu Asn 
805 810 815 

Asp Leu Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met 
820 825 830 

Thr Ser Phe Val Thr Val Ser Thr He Gly Leu He Trp Ala Leu Val 
835 840 845 

Ser Gly Trp Lys Leu Ser Leu Val Cys He Ser Met Phe Pro Leu He 
850 855 860 

He lie Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr 
865 870 * 875 " " 880 

Asp Tyr Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He 
885 890 895 

Val Thr Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe 
900 905 910 

Gin Leu Thr Tyr His Asp Leu Lys lie Lys Met Gin Gin He Ala Ser 
915 920 925 

Lys Arg Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met lie 
930 935 940 
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Val Met Cys lie Gin Ala lie lie Tyr Tyr Tyr Gly Leu Lys Leu Val 
945 950 955 960 

Met lie His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu 
965 970 975 

Leu Leu Phe Thr lie Met Ser Cys Thr Ser Leu Val Ser Gin lie Pro 
980 985 990 

Asp He Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg lie Leu 
995 1000 1° 05 

Asp Glu Lys His Asn Thr Leu Glu Val Glu Asn Asn Asn Ala Arg Thr 
1010 1015 1020 

Val Gly He Ala Gly His Thr Tyr His Gly Lys Glu Lys Lys Pro lie 
1025 1030 1035 1040 

Val Ser He Gin Asn Leu Thr Phe Ala Tyr Pro Ser Ala Pro Thr Ala 
1045 1050 1055 



Phe Val Tyr Lys Asn Met Asn Phe Asp Met Phe Cys Gly Gin Thr Leu 
1060 1065 1070 

Gly He He Gly Glu Ser Gly Thr Gly Lys Ser Thr Leu Val Leu Leu 
1075 1080 1085 

Leu Thr Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys He Asp Gly 
1090 1095 HOO 

Thr Asp Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser 
1105 IHO 1H5 

Val Val Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn 
1125 H 30 1135 

Leu Thr Tvr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp 
" 1140 H45 H50 

Ala Leu Lys Tyr Val Gly He His Asp Phe Val He Ser Ser Pro Gin 
1155 H60 H65 

Gly Leu Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala 
1170 H75 1180 

Gin Arg Leu Cys He Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu 
1185 ~ H90 H95 

He Leu Asp Glu Cys Thr Ser Ala Leu Asp Ser Val Ser Ser Ser lie 

1205 1210 1215 
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lie Asn Glu lie Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val 
1220 1225 1230 

He Thr His Ser Glu Gin Met Met Arg Ser Cys Asn Ser He Ala Val 
1235 1240 1245 

Leu Lys Asp Gly Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr 
1250 1255 1260 

Asn Asn Arg Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser 
1265 1270 1275 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic), 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: t # 

(A) ORGANISM: Saccharomyces cerevisiae 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3786 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1264.. 1524 

(D) OTHER INFORMATION: /note= "Substituted analogous 
sequence from human CFTR gene"' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AAC TTT TTA AGT TTT AAG ACT ACA AAA CAC TAT CAC ATT TTC AGG 4 8 

Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His lie Phe Arg 

1 5 10 I 5 

TAC GTG AAC ATA CGG AAT GAC TAC AGG CTG TTA ATG ATA ATG ATA ATA 96 

Tyr Val Asn lie Arg Asn Asp Tyr Arg Leu Leu Met He Met He He 
20 25 30 
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GGT ACC GTG GCA CTA TCA GTT TTC GTG GCT AAT GGG TCA CAT CAA GGT 14 4 

Gly Thr Val Ala Leu Ser Val Phe Val Ala Asn Gly Ser His Gin Gly 
35 40 45 

TTG TAT TCC CAA CTA GTA CAG AGG TCA ATG GCA GTA ATG GCA CTT GGT 192 
Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala Val Met Ala Leu Gly 
50 55 " 60 

GCG GCT TCT GTG CCA GTA ATG TGG CTT TCT CTA ACA AGT TGG ATG CAC 24 0 

Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp Met His 
65 70 75 80 

ATC GGC GAG AGA CAA GGC TTT AGA ATA CGG TCA CAG ATA TTG GAG GCA 2B8 
lie Gly Glu Arg Gin Gly Phe Arg lie Arg Ser Gin lie Leu Glu Ala 
85 90 95 

TAT TTG GAG GAA AAG CCA ATG GAA TGG TAC GAC AAT AAT GAA AAA TTG 336 
Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu Lys Leu 
100 105 110 

TTA GGA GAT TTT ACT CAA ATC AAC AGA TGT GTG GAA GAG CTA AGA TCA 384 
Leu Gly Asp Phe Thr Gin lie Asn Arg Cys Val Glu Glu Leu Arg Ser 
115 120 125 

AGC TCC GCA GAG GCA TCA GCC ATA ACT TTC CAG AAT TTA GTT GCA ATA 432 
Ser Ser Ala Glu Ala Ser Ala lie Thr Phe Gin Asn Leu Val Ala lie 
130 135 140 

TGT GCG CTT CTG GGG ACG TCA TTC TAC TAT TCT TGG TCA TTA ACT TTA 4 80 

Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu Thr Leu 
145 150 * 155 160 

ATT ATT CTT TGC AGC TCT CCA ATA ATC ACA TTT TTT GCA GTG GTG TTT 528 
He He Leu Cvs Ser Ser Pro He He Thr Phe Phe Ala Val Val Phe 
165 170 175 

TCC AGA ATG ATT CAT GTA TAT TCA GAG AAG GAG AAT TCT GAA ACG AGT 576 
Ser Arg Met He His Val Tyr Ser Glu Lys Glu Asn Ser Glu Thr Ser 
180 " 185 . 190 

AAA GCA GCC CAA TTA CTT ACA TGG TCG ATG AAT GCC GCT CAA TTA GTG 624 
Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin Leu Val 
195 200 205 

AGA TTA TAT TGT ACA CAA CGT CTA GAA AGG AAA AAA TTC AAG GAA ATC 672 
Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys Glu He 
210 215 220 

ATA CTA AAT TGT AAC ACT TTC TTC ATC AAG AGT TGC TTT TTT GTT GCT 720 
He Leu Asn Cvs Asn Thr Phe Phe He Lys Ser Cys Phe Phe Val Ala 
225 230 235 240 
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-. Tr rr*r*r GTA CAG 

s s s s s = a ss = i = « - - s - 

245 , 8 , rG r AAG CTG AAC ATT 

5SSSSSSSS55S. a - 

Asn Asp val He Thr y 280 

275 „™. rii AAA GGC GGA GTG 

« « K S S S - S vll S S S£ S «>• « - 

Leu Asn Asn Tnr ^ 2g5 

290 «i C p AAG CGA AAT 

«f - ffi s s s s s = Si s s* - - « - 

Ala Met biu 

3° 5 ~ rrfc CTA GAT TAT GCC ACC 

rrr CAC CAA TTT CCA x Tnr 

S S K - S ™ S SS P " 335 

355 rrr AAA TCT ACA TTA TCC 

s e s s s s S5 = « s 5 « - - - - 

405 ,„, CCT GGC ACC ATT AAA 

s s s a s a s s s s s s. - s »« - 

Glu Asn He He Pne ^> 44Q 
435 



766 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 
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CTC « „ « « - » - - g s s S s s 

Val He Lys Ala Cys Gin Leu bx 460 

« S s s !nsss:"S s £ - = - i 
« - s* a s 5 = s s s a ss - 5 s s 

Gin Arg Ala Arg Il« Ser x. 490 

„ « - » « E « is S 3S S S S S S S 

Leu Tyr Leu Leu Asp Ser rro ^ 5 iu 

500 _ Am AAA GGA AAG ACT ACA 

£ S £ St £ E £ S S 5 S E - * * 

s s = a ss s a s » s s= = = ™ = 

530 »^ rrr ACC CAG TCT GAA 

„ TTA » J» « |5 « S 5 Si f; Sv T« «. » «» 
Tyr Leu Met Lys Glu Gly «-x 555 

54S ' __ TAT CA c CTA CAG 

S S S 5 1 S S £ s K ~ " 111 »• 
ffi £ S SI S K £ £ I - - ~ S i ffi " 

580 — p »«■ TCT CAA TTG GAA ACA CCA 

$ S S SS S SS K S 52 2S = - S Gl ° ~ "° 
% = S s2 S S = S = s 5 s s S S 

610 n** ACA TCG AAC GTT AGA ACA 

TTG TCC TTT TAC « « « « « » » 2. «» V.! „ «. 
Leu Ser Phe Tyr Glu Ala lie -y ^ 

625 ^ rrr TAT GCA CTA AAA CAA 

s s s ^ ss a k a £ s s? - - - & - 
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CAA AAG AAC ACC GAA AGT TCA ACA GGG CCA CAA CTT CTG AGC ATT ATT 
ITn %s Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu Ser He lie 

660 665 



CAG ATT ATC AAA AGA ATG ATT AAA AGC ATA AGA TAC AAA AAA ATT OTA 
Gin He He Lys Arg Met He Lys Ser He Arg Tyr Lys Lys ixe n 
675 680 685 

ATC TTG GGA CTG CTA TGT TCT CTT ATC GCA GGC GCC ACA AAT CCC GTC 
lie Leu Gly Leu Leu Cys Ser Leu He Ala Gly Ala Thr Asn Pro Val 
690 6" 700 

TTT TCA TAC ACA TTC AGT TTC TTA CTA GAA GGA ATT GTC CCA TCC ACG 
Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly He Val Pro Se. Thr 
705 710 ~ 15 

GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG CTT CTT 
Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser Leu Leu 
725 130 

GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GOT AAA GGA 
Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala Lys Gly 

TTC CTA TTA GAT TGC TGC AGT GAA TAC TGG GTT ATG GAT CTT AGA AAT 
"e Leu £u Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu Arg Asn 
755 760 

GAA GTT ATG GAA AAA CTG ACG AGA AAG AAT ATG GAC TGG TTT TCT GGT 
Glu Val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe Ser Gl} 
770 775 780 

GAA AAC AAC AAG GCT TCT GAA ATT TCT GCT CTA GTC TTG AAT GAT TTG 
Glu £n £n Ala Ser Glu lie Ser Ala Leu Val Leu Asn Asp Leu 
785 790 795 

CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG ACT AGT 

Ser ~ ' ~ " ' ~ ~ 

805 



CGA GAT TTG AGG TCT TTG GTC TCT uaa tti nv, ™* — 
Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met Thr Ser 
805 810 81:5 

TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA GTA TCG GGC 
Phe Val Thr Val Ser Thr lie Gly Leu lie Trp Ala Leu Val Ser Gly 
820 825 830 

TGG AAG TTA AGT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA ATT ATA 
Trp Lys Leu Ser Leu Val Cys lie Ser Met Phe Pro Leu lie He He 
835 84° 84: > 

TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA GAT TAT 
™ Ser Ala lie <ryr Gly Gly lie Leu Gin Lys Cys Glu Thr Asp Tyr 
850 e 55 860 
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AAA CTT TAT AAT TGT GAA GTA GGC AAA ATT AAA ATA GAC GGT ACG GAC 3264 
Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys lie Asp Gly Thr Asp 
1075 " 1080 1085 

GTA AAT GAC TGG AAT TTG AC A AGT TTA AGA AAA GAA ATT TCA GTG GTT 3312 
Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser Val Val 
1090 1095 1100 

GAG CAA AAA CCT TTA TTA TTC AAT GGA ACC ATC AGA GAT AAC CTA ACT 3360 
Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn Leu Thr 
1105 1110 H15 ll 20 

TAT GGT TTA CAA GAT GAA ATA CTT GAA ATT GAA ATG TAT GAT GCA TTA 34 08 

Tyr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp Ala Leu 
J 1125 H30 H35 

AAA TAC GTA GGA ATC CAT GAC TTT GTA ATT TCA TCA CCT CAG GGC TTG 3456 
Lys Tyr Val Gly He His Asp Phe Val He Ser Ser Pro Gin Gly Leu 
1 1140 H45 H50 

GAT ACA CGT ATT GAT ACA ACT TTA CTG TCA GGT GGA CAA GCG CAA AGG 3504 
Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala Gin Arg 
1155 * H60 11^5 

CTT TGC ATA GCC AGA GCA CTT CTG AGA AAA TCA AAA ATT CTG ATT TTA 3552 
Leu Cys lie Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu He Leu 
1170 H75 H80 

GAT GAG TGT ACT TCA GCC TTG GAT TCT GTC AGC TCC TCT ATC ATC AAT 3600 
Asp Glu Cvs Thr Ser Ala Leu Asp Ser Val Ser Ser Ser He He Asn 
1185 H90 H95 1ZO0 

GAG ATC GTC AAA AAA GGT CCA CCT GCT CTA CTA ACA ATG GTT ATA ACG 364 8 

Glu He Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val lie Thr 
1205 1210 1215 

CAT AGT GAA CAA ATG ATG AGG TCT TGT AAC TCG ATT GCA GTT CTT AAA 3696 
His Ser Glu Gin Met Met Arg Ser Cys Asn Ser He Ala Val Leu Lys 
1220 1225 1230 

GAT GGT AAA GTG GTT GAG CGA GGT AAC TTC GAC ACT TTA TAT AAT AAT 3744 
Asp Gly Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr Asn Asn 
1235 1240 1245 

CGC GGG GAA TTA TTC CAA ATT GTT TCC AAC CAA AGC AGT TAA 37 86 

Arq Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser 
1250 1255 1260 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His He Phe Arc, 
1 5 

Tyr Val Asn lie Arg Asn Asp. Tyr Arg Leu Leu Met He Met lie He 

20 25 
Gly Thr Val Ala Leu Ser Val Phe Val Ala Asn Gly Ser His Gin Gly 



35 



Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala Val Met Ala Leu Gly 

50 55 60 

Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp Met His 
65 7° 75 



lie Gly Glu Arg Gin Gly Phe Arg lie Arg Ser Gin lie Leu Glu Ala 
85 

Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu Lys Leu 
100 105 

Phe 

115 120 



Ser Ser 



Ala Glu Ala Ser Ala lie Thr Phe Gin Asn Leu Val Ala He 



Leu Gly Asp Phe Thr Gin He Asn Arg Cys Val Glu Glu Leu Arg Ser 

130 135 I 40 

Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu Thr Leu 
145 150 

He lie Leu Cys Ser Ser Pro lie lie Thr Phe Phe Ala Val Val Phe 

165 170 
Ser Arg Met lie His Val Tyr Ser Glu Lys Glu Asn Ser Glu Thr Ser 



ieo 



Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin Leu Val 
J !95 200 

Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys Glu He 

215 ZZKJ 



210 



He Leu Asn Cys Asn Thr Phe Phe lie Lys Ser Cys Phe Phe Val Ala 
225 230 2 -*5 
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Ala Asn Ala Gly lie Leu Arg Phe Leu Thr Leu Thr Met Phe Val Gin 
245 250 255 

Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu Asn He 

260 265 270 

Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly Ser Thr 
275 280 285 

Leu Asn Asn Thr Leu His Gin lie Val Val Leu Gin Lys Gly Gly Val 
290 295 300 

Ala Met Glu Lys He Met Thr Leu Leu Lys Asp Gly Ser Lys Arg Asn 



305 



310 



315 



Pro Leu Asn Lys Thr Val Ala His Gin Phe Pro Leu Asp Tyr Ala Thr 
325 330 335 

Ser Asp Leu Thr Phe Ala Asn Val Ser Phe Ser Tyr Pro Ser Arg Pro 



340 



345 



350 



Ser Glu Ala Val Leu Lys Asn Val Ser Leu Asn Phe Ser Ala Gly Gin 

36 0 365 



355 

Phe Thr Phe He Val Gly Lys Ser Gly Ser Gly Lys Ser Thr Leu Ser 
370 375 380 

Asn Leu Leu Leu Arg Phe Tyr Asp Gly Tyr Asn Gly Ser lie Ser lie 
365 390 395 

Asn Gly His Asn He Gin Thr lie Asp Gin Lys Leu Leu He Glu Asn 
405 410 415 

He Thr Val Val Glu Gin Phe Ser Trp He Met Pro Gly Thr He Lys 
420 425 430 

Glu Asn He He Phe Gly Val Ser Tyr Asp Glu Tyr Arg Tyr Arg Ser 
435 440 '445 

Val He Lys Ala Cvs Gin Leu Glu Glu Asp He Ser Lys Phe Ala Glu 

455 460 



450 



Lys Asp Asn He Val Leu Gly Glu Gly Gly He Thr Leu Ser Gly Gly 
465 470 475 

Gin Arg Ala Arg He Ser Leu Ala Arg Ala Val Tyr Lys Asp Ala Asp 

4 85 4 90 4 93 



Leu Tyr Leu Leu Asp Ser Pro Phe Gly Tyr Leu Asp He Val His Arg 



500 



505 
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Asn Leu Leu Met Lys Ala He Arg His Trp Arg Lys Gly Lys Thr Thr 

515 520 
He lie Leu Thr Bis Glu Leu Ser Gin lie Glu Ser Asp Asp Tyr Leu 

530 535 
Ty r Leu Met Lys Glu Gly Glu Val Val Glu Ser Gly Thr Gin Ser Glu 
545 550 

X,e U Leu Ala Asp Pro Thr Thr Thr Phe Ser Thr Trp Tyr His- Leu Gin 
565 

Asn Asp Tyr Ser Asp Ala Lys Thr lie Val Asp Thr Glu Thr Glu Glu 

580 5Bb 
Lys ser lie His Thr Val Glu Ser Phe Acn Ser Gin Leu Glu Thr Pro 

595 600 
Lys Leu Gly Ser Cys Le'u Ser Asn Leu Gly Tyr Asp Glu Thr Asp Gin 

610 615 
Le u Ser Phe Tyr Glu Ala lie Tyr Gin Lys Arg Ser Asn Val Arg Thr 
625 630 

AT* Arg Val Lys Val Glu Glu Glu Asn II. Gly Tyr Ala Leu £. Gin 

Gin Lys Asn Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu Ser He He 
660 665 

Gin Xle He Lys Arg Met lie Lys Ser He Arg Tyr Lys Lys He Leu 

675 680 
He Leu Gly Leu Leu Cys Ser Leu He Ala Gly Ala Thr Asn Pro Val 

690 695 
Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly He Val Pro Ser Thr 
705 710 

Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser Leu Leu 

Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala Lys Gly 

Phe Leu Leu Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu Arg Asn 

755 ' 60 
Glu val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe Ser Gly 
770 775 
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Glu Asn Asn Lys Ala Ser Glu lie Ser Ala Leu Val Leu Asn Asp Leu 
785 790 795 800 

Arq Asp Leu Arq Ser Leu Val Ser Glu Phe Leu Ser Ala Met Thr Ser 
y 805 810 815 

Phe Val Thr Val Ser Thr lie Gly Leu He Trp Ala Leu Val Ser Gly 
820 825 830 

Trp Lys Leu Ser Leu Val Cys He Ser Met Phe Pro Leu He He He 
835 840 845 

Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr Asp Tyr 
850 855 860 

Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He Val Thr 
865 870 875 880 

Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe Gin Leu 
885 890 895 

Thr Tyr His Asp Leu Lys He Lys Met Gin Gin He Ala Ser Lys Arg 
9 00 905 910 

Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met He Val Met 
915 - 920 925 

Cys He Gin Ala He He Tyr Tyr Tyr Gly Leu Lys Leu Val Met He. 
930 935 940 

His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu Leu Leu 
945 " 950 955 960 

Phe Thr He Met Ser Cys Thr Ser Leu Val Ser Gin He Pro Asp He 
965 970 975 

Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg He Leu Asp Glu 
980 985 990 

Lys His Asn Thr Leu Glu Val Glu Asn Asn Asn Ala Arg Thr Val Gly 
995 1000 1005 

He Ala Gly His Thr Tyr His Gly Lys Glu Lys Lys Pro He Val Ser 
1010 1015 1020 

He Gin Asn Leu Thr Phe Ala Tyr Pro Ser Ala Pro Thr Ala Phe Val 
1025 1030 1035 1040 

*vr Lys Asn Met Asn Phe Asp Met Phe Cys Gly Gin Thr Leu Gly He 
1045 1050 1055 
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He Gly Glu Ser Gly Thr Gly Lys Ser Thr Leu Val Leu Leu Leu Thr 
1060 1065 1070 

Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys He Asp Gly Thr Asp 
1075 1080 1085 

Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser Val Val 
1090 1095 1100 

Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn Leu Thr 
1105 mo 1115 H2< 

Tyr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp Ala Leu 
1125 1130 H35 

Lys Tyr Val Gly He His Asp Phe Val He Ser Ser Pro Gin Gly Leu 
1140 1145 1150 

Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala Gin Arg 
1155 1160 H65 

Leu Cys He Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu He Leu 
1170 H75 1180 

Asp Glu Cys Thr Ser Ala Leu Asp Ser Val Ser Ser Ser He He Asn 
H85 1190 1195 120C 

Glu He Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val He Thr 



Asp Gly Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr Asn Asn 
1235 1240 1245 

Arg Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser 



12 05 



1210 



1215 



His Ser Glu Gin Met Met Arg Ser 
1220 



Cys Asn Ser He Ala Val Leu Lys 
1225 1230 



1250 



1255 



1260 • 
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What is claimed is: 



X A method of identify!-, potential treatments 
which correct mislocalization and/or misfunctron of 
CFTR resulting from mutations said method 
comprising: forming a chimeric gene 
substituting a portion of a gene 
encodes the first nucleotide binding domain of 
cystic fibrosis conductance regulator wrth a qene 
sequence which encodes STE6; said CFTR sequence 
being a CFTR gene sequence encoding a plurality 

. „ 444 to and including 

amino acids from ammo acid 444 to a 

amino acid 577, such that introduction of a 
station in the CFTR region inhibits express, of 
the chimeric STE6 gene producti transforming said 
chimeric gene seguence into a yeast 
functional STE6 gene; introducing said potential 
treatment to said yeast strain; and assaying for 
the production of diploid yeast colonies following 
yeast cell mating. 

2 . The method of claim 1 wherein said mutation is 
. deletion of phenylalanine at position SOB of the 
CFTR gene product. 

3 The method of claim 1 wherein said CFTR 
substitute is the qene sequence which encodes from 
amino acid 494 through 546 of the CFTR gene product 
(SEQ ID #1). 

T u e m „thod of claim 1 wherein said CFTR 

\ is the qene sequence which encodes from 
substitution is the gene s 4 

amino acid 494 up to and including amino acid 
the CFTR gene product (SEQ ID #3). 
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5. The method of claim 1 wherein said CFTR 
substitution is the gene sequence which encodes from 
amino acid 494 up to and including 577 of the CFTR 
gene product (SEQ ID #5). 

6. The method of claim 3, 4 , or 5 wherein said 
CFTR substitution includes a mutation which causes 
cystic fibrosis. 

7. A chimeric protein comprising a yeast STE6 
protein product having substituted therein an 
analogous portion of CFTR nucleotide binding domain 
including a plurality of amino acids from amino acid 
443 to amino acid 577. 

8. The protein of claim 7 wherein said CFTR 
sequence includes a mutation which results in 
cystic fibrosis. 

9. The protein of claim 7 wherein said amino 
acid sequence is SEQ ID # 2. 

10. The protein of claim 7 wherein said amino 
acid sequence is SEQ ID # 4. 

11. The protein of claim 7 wherein said amino 
acid sequence is SEQ ID # 6. 

12. A chimeric protein sufficiently homologous to 
that of claim 7 such that introduction of a 
mutation in the CFTR portion will prevent 
transport of a-factor out a yeast cell thereby 
preventing yeast mating. 
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13. A DNA sequence which encodes the protein of 
claim 7. 

14. A host cell transformed with the DNA sequence 
of claim 13 in a manner allowing expression of the 
STE6 gene . 

15. A method of screening for potential inhibitors of 
ABC transporters by the use of STE6 chimeras, said 
method comprising selecting from the class of related 
proteins consisting essentially of multi-drug 
resistance (mdr) or P-glycoprotein, the choroquine 
resistance transporter (pfmdr) of Plasmodium falciparum, 
and CFTR; substituting the nucleotide sequence of the 
STE6 gene with the sequence encoding the analogous 
portion of said ABC transporter; transforming said 
chimeric gene sequence into a yeast which is deleted for 
the STE6 gene; introducing a treatment to said 
transformed yeast strain; and assaying for the production 
of yeast diploid colonies following yeast cell mating. 
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SELECTIVE MEDIA PLATE WITH DIPLOID CDLDN1ES 
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H4 Chimera (Seq. ID No. l) 



ATG AAC TTT TTA 
Met Asn Phe Leu 
1 


AGT TTT AAG 
Ser Phe Lys 
5 


ACT 
Thr 


ACA 
Thr 


AAA 
Lvs 
10 


CAC TAT 
His Tyr 


CAC 
His 


ATT 
He 


TTC 
Phe 
15 


AGG 
Arg 


TAC GTG AAC ATA 
Tvt* Va 1 Asn Tie 

x j l. vox noii -i- ^ 

20 


CGG AAT GAC 
Atct Asn Asn 


TAC 
Tvt 


AGG 
Arcr 

*vx y 

25 


CTG 
Leu 


TTA 
Leu 


ATG 
Met 


ATA 
He 


ATG. 
Met 
30 


ATA 
He 


ATA 
He 


GGT ACC GTG GCA 
G±y inr vai Aia 
35 


ACA GGC CTA 
inr c»±y L*eu 


GTG 

\7a 1 

vax 
40 


CCG 


GCA 

Ala 
Aid 


ATT 
He 


ACT 
Thr 


TCT 
Ser 
45 


ATC 
He 


CTG 
Xjeu 


ACG 

xllx 


GGC AGA GTG TTC 

/**1ir TV >-rr Val "DVio 

i»±y Arg vai -rue 
50 


GAT CTA CTA 

ASp iicu bcu 

55 


TCA 

Cor 

Dei 


GTT 
vai 


TTC 

Oho 

rue 


GTG 
Val 


GCT 
Ala 
60 


AAT GGG 
Asn Gly 


TCA 

Car 


CAT 
nib 


CAA GGT TTG TAT 
filn fZlv T.*»\i Tvt 

65 


TCC CAA CTA 

OCX Xi 1 i>CU 

70 


GTA 
Val 


CAG 
Gin 


AGG 

Axy 


TCA 
Ser 
75 


ATG 
Met 


GCA 

Ala 


GTA 
Val 


ATG 

ric u. 


GCA 
Ala 
80 


CTT GGT GCG GCT 
Leu Gly Ala Ala 


TCT GTG CCA 
Ser Val Pro 
85 


GTA 
Val 


ATG 
Met 


TGG 
Trp 
90 


CTT 
Leu 


TCT 
Ser 


CTA 
Leu 


ACA 
Thr 


AGT 
Ser 
95 


TGG 
Trp 


ATG CAC ATC GGC 
Met His lie Gly 
100 


GAG AGA CAA 

Glu Arg Gin 


GGC 

Gly 


TTT 
Phe 
105 


AGA 
Arg 


ATA CGG 
He Arg 


TCA 
Ser 


CAG 
Gin 
110 


ATA 

He 


TTG 
Leu 


GAG GCA TAT TTG 
Glu Ala Tyr Leu 
115 


GAG GAA AAG 
Glu Glu Lys 


CCA 

Pro 

120 


ATG 
Met 


GAA 
Glu 


TGG TAC 

Trp Tyr 


GAC AAT 
Asp Asn 
125 


AAT 
Asn 


GAA 
Glu 


AAA TTG TTA GGA 
Lys Leu Leu Gly 
130 


GAT TTT ACT 
Asp Phe Thr 
135 


CAA 
Gin 


ATC 
He 


AAC 
Asn 


AGA TGT 
Arg Cys 
140 


GTG 
Val 


GAA 
Glu 


GAG 
Glu 


CTA 
Leu 


AGA TCA AGC TCC 
Arg Ser Ser Ser 
145 


GCA GAG GCA 
Ala Glu. Ala 
150 


TCA 
Ser 


GCC 
Ala 


ATA 
He 


ACT 
Thr 
155 


TTC 
Phe 


CAG 
Gin 


AAT 
Asn 


TTA 
Leu 


GTT 
Val 
160 


GCA ATA TGT GCG 
Ala lie Cys Ala 


CTT CTG GGG 
Leu Leu Gly 
165 


ACG 
Thr 


TCA 
Ser 


TTC 
Phe 
170 


TAC TAT 
Tyr Tyr 


TCT TGG 
Ser Trp 


TCA 
Ser 
175 


TTA 
Leu 


ACT TTA ATT ATT 
Thr Leu lie lie 
180 


CTT TGC AGC 
Leu Cys Ser 


TCT 
Ser 


CCA 
Pro 
185 


ATA 
He 


ATC 
He 


ACA 
Thr 


TTT 
Phe 


TTT 
Phe 
190 


GCA 
Ala 


GTG 
Val 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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GTG TTT TCC AGA ATG ATT CAT GTA TAT TCA GAG AAG GAG AAT TCT GAA 624 
Val Phe Ser Arg Met lie His Val Tyr Ser Glu Lys Glu Asn Ser Glu 
195 200 205 

ACG AGT AAA GCA GCC CAA TTA CTT ACA TGG TCG ATG AAT GCC GCT CAA 672 
Thr Ser Lys Ala Ala Gin Leu. Leu Thr Trp Ser Met Asn Ala Ala Gin 

210 215 220 

TTA GTG AGA TTA TAT TGT ACA CAA CGT CTA GAA AGG AAA AAA TTC AAG 720 
Leu Val Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys 

225 230 235 240 

GAA ATC ATA CTA AAT TGT AAC ACT TTC TTC ATC AAG AGT TGC TTT TTT 768 
Glu lie He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe 
245 250 255 

GTT GCT GCA AAC GCT GGG ATC TTG AGA TTT TTG ACG TTG ACT ATG TTT 816 
Val Ala Ala Asn Ala Gly He Leu Arg Phe Leu Thr Leu Thr Met Phe 

260 265 270 

GTA CAG GGA TTC TGG TTT GGT TCC GCA ATG ATC AAA AAG GGC AAG CTG 864 
Val Gin Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu 
275 280 285 

AAC ATT AAC GAT GTA ATC ACT TGC TTC CAT TCA TGC ATT ATG CTG GGC 912 
Asn He Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly 

290 295 300 

TCG ACT TTA AAT AAT ACA TTA CAC CAA ATA GTT GTT CTT CAA AAA GGC 960 
Ser Thr Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly 

305 310 315 320 

GGA GTG GCT ATG GAA AAA ATC ATG ACT CTA TTA AAA GAT GGA TCC AAG 1008 
Gly Val Ala Met Glu Lys He Met Thr Leu Leu Lys Asp Gly Ser Lys 

325 330 335 

CGA AAT CCT TTA AAT AAA ACT GTA GCC CAC CAA TTT CCA CTA GAT TAT 1056 
Arg Asn Pro Leu Asn Lys Thr Val Ala His Gin Phe Pro Leu Asp Tyr 

340 345 350 

GCC ACC AGT GAT CTA ACA TTT GCT AAT GTT TCG TTT TCT TAT CCA AGC 1104 
Ala Thr Ser Asp Leu Thr Phe Ala Asn Val Ser Phe Ser Tyr Pro Ser 

355 360 365 

AGA CCT TCG GAA GCA GTT TTA AAG AAC GTT AGT TTA AAT TTC TCT GCA 1152 
Arg Pro Ser Glu Ala Val Leu Lys Asn Val Ser Leu Asn Phe Ser Ala 

370 375 380 

GGA CAA TTT ACT TTC ATA GTA GGA AAA TCA GGC TCA GGT AAA TCT ACA 1200 
Gly Gin Phe Thr Phe He Val Gly Lys Ser Gly Ser Gly Lys Ser Thr 
385 390 395 400 
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TTA TCC AAC TTA TTA TTA AGG TTC TAC GAT GGC TAT AAT GGA TCG ATA 
22 sir A^n Leu Leu Leu Arg Phe Tyr Asp Gly Tyr Asn Gly Ser He 

405 410 

TCT ATC AAT GGC CAC AAT ATC CAA ACA ATC GAC CAA AAA TTG CTA ATT 
SS III iin Sy His Asn lie Gin Thr lie Asp Gin Lys Leu Leu lie 
420 425 



GAA AAT ATC 
Glu Asn lie 

435 

ATT AAA GAA 
lie Lys Glu 
450 

AGA AGC GTC 
Arg Ser Val 
465 

GCA GAG AAA 
Ala Glu Lys 



GGC GGG CAA 
Gly Gly Gin 



ACC GTC GTA GAA 
Thr Val Val Glu 

STE6 



CAG TTT TCC TGG ATT ATG CCT GGC ACC 
Gin Phe ser Trp He Met Pro Gly Thr 

440 445 
Zl CFTR 

AAT ATC ATC TTT GGT GTT TCC TAT GAT GAA TAT AGA TAC 
Asn He He Phe Gly Val Ser Tyr Asp Glu Tyr Arg Tyr 

455 460 

ATC AAA GCA TGC CAA CTA GAA GAG GAC ATC TCC AAG TTT 
lie Lys Ala Cys Gin Leu Glu Glu Asp He Ser Lys Phe 

GAC AAT ATA GTT CTT GGA GAA GGT GGA ATC ACA CTG AGT1 
Asp Asn He Val Leu Gly Glu Gly Gly He Thr Leu Ser 

485 490 495^ ± 

CAA CAA AGA GTT GCT ATA GCA CGT GCA TTC ATC AGA GAT 
Gin Gin Arg Val Ala He Ala Arg Ala Phe He Arg Asp 

500 505 510 



ACT CCA ATA TTA TTC TTA GAC GAA GCT GTA TCG GCT CTA GAT ATT GTT 
JS Pr^ He Leu Phe Leu Asp Glu Ala Val Ser Ala Leu Asp He Val 

515 520 525 

CAT CGC AAC CTG TTG ATG AAG GCA ATT AGG CAT TGG AGG AAA GGA AAG 
Ss Sg ^n Leu £eu Met Lys Ala He Arg His Trp Arg Lys Gly Lys 
530 535 540 

ACT ACA ATC ATA TTG ACG CAT GAG TTG AGC CAA ATT GAA TCT GAT GAC 
JS JS He S Leu Thr His Glu Leu Ser Gin He Glu Ser Asp Asp 

545 550 555 

TAT TTA TAT TTA ATG AAG GAA GGT GAA GTT GTT GAA AGC GGC ACC CAG 
Tyr Leu Tyr Leu Met Lys Glu Gly Glu Val Val Glu Ser Gly Thr Gin 

565 570 575 

TCT GAA CTT CTA GCC GAT CCG ACC ACT ACA TTT AGC ACA TGG TAT CAC 
Ser Glu Leu Leu Ala Asp Pro Thr Thr Thr Phe Ser Thr Trp Tyr His 
580 585 590 

CTA CAG AAT GAC TAC TCT GAT GCG AAA ACT ATT GTA GAT ACA GAG ACT 
££ j£n tsp Tyr Ser Asp Ala Lys Thr He Val Asp Thr Glu Thr 

595 600 605 



1248 



1296 



1344 



1392 



1440 



1488 



STE6 



1536 



1584 



1632 



1680 



1728 



1776 



1824 
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GAA GAA AAA TCT ATA CAC ACT GTG GAA AGT TTT AAC TCT CAA TTG GAA 1872 

Glu Glu Lys Ser lie His Thr Val Glu Ser Phe Asn Ser Gin Leu Glu 

610 615 620 

ACA CCA AAA CTT GGA TCA TGC TTA AGT AAT CTG GGA TAT GAT GAG ACA 1920 

Thr Pro Lys Leu Gly Ser Cys Leu Ser Asn Leu Gly Tyr Asp Glu Thr 

625 " 630 635 640 

GAT CAG TTG TCC TTT TAC GAA GCA ATC TAT CAA AAA AGA TCG AAC GTT 1968 

Asp Gin Leu Ser Phe Tyr Glu Ala lie Tyr Gin Lys Arg Ser Asn Val 

645 650 655 

AGA ACA AGA AGG GTT AAA GTT GAA GAG GAA AAT ATT GGG TAT GCA CTA 2016 

Arg Thr Arg Arg Val Lys Val Glu Glu Glu Asn lie Gly Tyr Ala Leu 

660 * 665 670 

AAA CAA CAA AAG AAC ACC GAA AGT TCA ACA GGG CCA CAA CTT CTG AGC 2064 

Lys Gin Gin Lys Asn Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu Ser 

675 680 685 

ATT ATT CAG ATT ATC AAA AGA ATG ATT AAA AGC ATA AGA TAC AAA AAA 2112 

He He Gin He He Lys Arg Met He Lys Ser He Arg Tyr Lys Lys 

690 695 700 

ATT CTA ATC TTG GGA CTG CTA TGT TCT CTT ATC GCA GGC GCC ACA AAT 2160 

He Leu He Leu Gly Leu Leu Cys Ser Leu He Ala Gly Ala Thr Asn 

705 710 715 720 

CCC GTC TTT TCA TAC ACA TTC AGT TTC TTA CTA GAA GGA ATT GTC CCA 2208 

Pro Val Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly He Val Pro 

725 730 735 

TCC ACG GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG 2256 

Ser Thr Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser 

740 745 750 

CTT CTT GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GCT 2304 

Leu Leu Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala 

755 760 765 

AAA GGA TTC CTA TTA GAT TGC TGC AGT GAA TAC TGG GTT ATG GAT CTT 2352 

Lys Gly Phe Leu Leu Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu 

770 775 780 

AGA AAT GAA GTT ATG GAA AAA CTG ACG AGA AAG AAT ATG GAC TGG TTT 2400 

Arg Asn Glu Val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe 

785 790 795 800 

TCT GGT GAA AAC AAC AAG GCT TCT GAA ATT TCT GCT CTA GTC TTG AAT 2448 

Ser Gly Glu Asn Asn Lys Ala Ser Glu He Ser Ala Leu Val Leu Asn 

805 810 815 
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GAT TTG CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG 2496 
Asp Leu Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met 
820 825 830 

ACT AGT TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA GTA 2544 
Thr Ser Phe Val Thr Val Ser Thr He Gly Leu He Trp Ala Leu Val 
835 840 845 

TCG GGC TGG AAG TTA AGT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA 2592 
Ser Gly Trp Lys Leu Ser Leu Val Cys He Ser Met Phe Pro Leu He 

850 855 860 

ATT ATA TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA 2640 
He lie Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr 

865 870 875 880 

GAT TAT AAG ACA TCT GTT GCT CAG TTA GAA AAC TGC CTG TAC CAG ATT 2688 
Asp Tyr Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He 
885 890 895 

GTC ACT AAC ATT AAA- ACC ATT AAG TGC TTA CAA GCT GAA TTT CAT TTT 2736 
Val Thr Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe 
900 905 910 

CAA TTG ACC TAC CAT GAC TTG AAG ATA AAA ATG CAA CAA ATT GCC TCC 2784 
Gin Leu Thr Tyr His Asp Leu Lys He Lys Met Gin Gin He Ala Ser 
915 920 925 

AAA CGC GCC ATT GCC ACA GGA TTT GGT ATA TCT ATG ACA AAC ATG ATT 2832 
Lys Arg Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met He 
930 935 940 

GTC ATG TGT ATC CAA GCT ATT ATT TAC TAC TAT GGC CTA AAG CTG GTT 2880 
Val Met Cys He Gin Ala He He Tyr Tyr Tyr Gly Leu Lys Leu Val 
945 " 950 955 960 

ATG ATT CAC GAG TAC ACC TCA AAG GAA ATG TTT ACG ACT TTC ACT TTG 2928 
Met He His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu 
965 970 975 

TTA TTA TTC ACT ATT ATG TCA TGC ACT AGC CTA GTA AGT CAG ATA CCC 297 6 

Leu Leu Phe Thr He Met Ser Cys Thr Ser Leu Val Ser Gin He Pro 

980 985 990 

GAT ATA AGT AGA GGC CAA CGT GCT GCC AGT TGG ATC TAT AGG ATT CTT 3 024 

Asp He Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg He Leu 
S95 1000 1005 

GAT GAA AAG CAT AAT ACC CTA GAG GTT GAA AAC AAT AAT GCT AGA ACA 3072 
Asp Glu Lys His Asn Thr Leu Glu Val Glu Asn Asn Asn Ala Arg Thr 
1010 1015 1020 



FIG.4E 



BNSDOCID: <WO 9425607A1J_> 



WO 94/25607 



PCT/US94/04379 



8/23 



GTG GGA ATA GCT GGT CAC ACC TAC CAT GGC AAA GAA AAA AAA CCA ATC 3120 
Val Gly lie Ala Gly His Thr Tyr His Gly Lys Glu Lys Lys Pro He 
1025 1030 1035 1040 

GTT TCA ATT CAA AAT TTG ACA TTT GCC TAT CCA TCT GCA CCT ACC GCC 3168 
Val Ser He Gin Asn Leu Thr Phe Ala Tyr Pro Ser Ala Pro Thr Ala 
1045 1050 1055 

TTT GTT TAC AAA AAC ATG AAT TTT GAC ATG TTT TGC GGA CAG ACG TTA 3216 
Phe Val Tyr Lys Asn Met Asn Phe Asp Met Phe Cys Gly Gin Thr Leu 
1060 1065 1070 

GGT ATC ATT GGT GAA TCA GGC ACA GGA AAG TCT ACA CTT GTG CTT TTA 3264 
Gly He He Gly Glu Ser Gly Thr Gly Lys Ser Thr Leu Val Leu Leu 
1075 1080 1085 

TTA ACA AAA CTT TAT AAT TGT GAA GTA GGC AAA ATT AAA ATA GAC GGT 3 312 

Leu Thr Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys He Asp Gly 
1090 1095 HOO 

ACG GAC GTA AAT GAC TGG AAT TTG ACA AGT TTA AGA AAA GAA ATT TCA 3360 
Thr Asp Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser 
1105 1110 1115 1120 

GTG GTT GAG CAA AAA CCT TTA TTA TTC AAT GGA ACC ATC AGA GAT AAC 3408 
Val Val Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn 
1125 H30 1135 

CTA ACT TAT GGT TTA CAA GAT GAA ATA CTT GAA ATT GAA ATG TAT GAT 3456 
Leu Thr Tyr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp 
1140 H45 1150 

GCA TTA AAA TAC GTA GGA ATC CAT GAC TTT GTA ATT TCA TCA CCT CAG 3504 
Ala Leu Lys Tyr Val Gly He His Asp Phe Val He Ser Ser Pro Gin 
1155 H60 H65 

GGC TTG GAT ACA CGT ATT GAT ACA ACT TTA CTG TCA GGT GGA CAA GCG 3552 
Gly Leu Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala 
1170 H75 1180 

CAA AGG CTT TGC ATA GCC AGA GCA CTT CTG AGA AAA TCA AAA ATT CTG 3600 
Gin Arg Leu Cys He Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu 
1185 ' 1190 H95 1200 

ATT TTA GAT GAG TGT ACT TCA GCC TTG GAT TCT GTC AGC TCC TCT ATC 3648 
He Leu Asp Glu Cys Thr Ser Ala Leu Asp Ser Val Ser Ser Ser He 
1205 1210 1215 

ATC AAT GAG ATC GTC AAA AAA GGT CCA CCT GCT CTA CTA ACA ATG GTT 3696 
He Asn Glu He Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val 
1220 " 1225 1230 
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ATA ACG CAT AGT GAA CAA ATG ATG AGG TCT TGT AAC TCG ATT GCA GTT 3744 
lie Thr His Ser Glu Gin Met Met Arg Ser Cys Asn Ser lie Ala Val 
1235 I 240 1245 

CTT AAA GAT GGT AAA GTG GTT GAG CGA GGT AAC TTC GAC ACT TTA TAT 3792 
SS ^ Asp My Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr 
1250 1255 1260 

AAT AAT CGC GGG GAA TTA TTC CAA ATT GTT TCC AAC CAA AGC AGT TAA 3840 
Kn £» Arg Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser * 
1265 1270 12 
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H5 Chimera (Seq. ID No. 3) 

ATG AAC TTT TTA AGT TTT AAG ACT ACA AAA CAC TAT CAC ATT TTC AGG 48 
Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His He Phe Arg 
! 5 10 15 

TAC GTG AAC ATA CGG AAT GAC TAC AGG CTG TTA ATG ATA ATG. ATA ATA 96 
Tyr Val Asn He Arg Asn Asp Tyr Arg Leu Leu Met He Met He He 
20 25 30 

GGT ACC GTG GCA ACA GGC CTA GTG CCG GCA ATT ACT TCT ATC CTG ACG 144 
Gly Thr Val Ala Thr Gly Leu Val Pro Ala He Thr Ser He Leu Thr 
35 40 45 

GGC AGA GTG TTC GAT CTA CTA TCA GTT TTC GTG GCT AAT GGG TCA CAT 192 
Gly Arg Val Phe Asp Leu Leu Ser Val Phe Val Ala Asn Gly Ser His 
50 55 60 

CAA GGT TTG TAT TCC CAA CTA GTA CAG AGG TCA ATG GCA GTA ATG GCA 240 
Gin Gly Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala Val Met Ala 
65 70 75 80 

CTT GGT GCG GCT TCT GTG CCA GTA ATG TGG CTT TCT CTA ACA AGT TGG 288 
Leu Gly Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp 
85 90 95 

ATG CAC ATC GGC GAG AGA CAA GGC TTT AGA ATA CGG TCA CAG ATA TTG 336 
Met His He Gly Glu Arg Gin Gly Phe Arg He Arg Ser Gin He Leu 
100 105 HO 

GAG GCA TAT TTG GAG GAA AAG CCA ATG GAA TGG TAC GAC AAT AAT GAA 384 
Glu Ala Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu 
115 120 125 

AAA TTG TTA GGA GAT TTT ACT CAA ATC AAC AGA TGT GTG GAA GAG CTA 4 32 

Lys Leu Leu Gly Asp Phe Thr Gin He Asn Arg Cys Val Glu Glu Leu 
130 135 140 

AGA TCA AGC TCC GCA GAG GCA TCA GCC ATA ACT TTC CAG AAT TTA GTT 480 
Arg Ser Ser Ser Ala Glu Ala Ser Ala He Thr Phe Gin Asn Leu Val 
145 150 155 160 

GCA ATA TGT GCG CTT CTG GGG ACG TCA TTC TAC TAT TCT TGG TCA TTA 528 
Ala He Cys Ala Leu Leu Gly Thr Ser Phe Tyr Tyr Ser Trp Ser Leu 
165 170 175 

ACT TTA ATT ATT CTT TGC AGC TCT CCA ATA ATC ACA TTT TTT GCA GTG 576 
Thr Leu He He Leu Cys Ser Ser Pro He He Thr Phe Phe Ala Val 
180 " 185 190 
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GTG TTT TCC AG A ATG ATT CAT GTA TAT TCA GAG AAG GAG AAT TCT GAA 624 
val Phe Ser Arg Met lie His Val Tyr Ser Glu Lys Glu Asn Ser Glu 
195 200 205 

ACG AGT AAA GCA GCC CAA TTA CTT ACA TGG TCG ATG AAT GCC GCT CAA 672 
Thr Ser Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin 
210 215 220 

TTA GTG AG A TTA TAT TGT ACA CAA CGT CTA GAA AGG AAA AAA TTC AAG 720 
Leu Val Arg Leu Tyr Cys Thr Gin Arg Leu Glu Arg Lys Lys Phe Lys 
225 230 235 240 

GAA ATC ATA CTA AAT TGT AAC ACT TTC TTC ATC AAG AGT TGC TTT TTT 768 
Glu He He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe 
245 250 255 

GTT GCT GCA AAC GCT GGG ATC TTG AGA TTT TTG ACG TTG ACT ATG TTT 816 
Val Ala Ala Asn Ala Gly He Leu Arg Phe Leu Thr Leu Thr Met Phe 
260 265 270 

GTA CAG GGA TTC TGG TTT GGT TCC GCA ATG ATC AAA AAG GGC AAG CTG 864 
Val Gin Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu 

275 280 285 

AAC ATT AAC GAT GTA ATC ACT TGC TTC CAT TCA TGC ATT ATG CTG GGC 912 
Asn He Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly 

290 295 300 

TCG ACT TTA AAT AAT ACA TTA CAC CAA ATA GTT GTT CTT CAA AAA GGC 960 
Ser Thr Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly 
305 310 315 320 

GGA GTG GCT ATG GAA AAA ATC ATG ACT CTA TTA AAA GAT GGA TCC AAG 1008 
Gly Val Ala Met Glu Lys He Met Thr Leu Leu Lys Asp Gly Ser Lys 
325 330 335 

CGA AAT CCT TTA AAT AAA ACT GTA GCC CAC CAA TTT CCA CTA GAT TAT 1056 
Arg Asn Pro Leu Asn Lys Thr Val Ala His Gin Phe Pro Leu Asp Tyr 
340 ^ 345 350 

GCC ACC AGT GAT CTA ACA TTT GCT AAT GTT TCG TTT TCT TAT CCA AGC 1104 
Ala Thr Ser Asp Leu Thr Phe Ala Asn Val Ser Phe Ser Tyr Pro Ser 

355 * 360 365 

AGA CCT TCG GAA GCA GTT TTA AAG AAC GTT AGT TTA AAT TTC TCT GCA 1152 
Ara Pro Ser Glu Ala Val Leu Lys Asn Val Ser Leu Asn Phe Ser Ala 

370 375 380 

GGA CAA TTT ACT TTC ATA GTA GGA AAA TCA GGC TCA GGT AAA TCT ACA 1200 
Gly Gin Phe Thr Phe He Val Gly Lys Ser Gly Ser Gly Lys Ser Thr 
385 390 395 400 
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TTA TCC AAC TTA TTA TTA AGG TTC TAC GAT GGC TAT AAT GGA TCG ATA 
Ser Asn Leu Leu Leu Arg Phe Tyr Asp Gly Tyr Asn Gly Ser He 
405 410 

TCT ATC AAT GGC CAC AAT ATC CAA ACA ATC GAC CAA AAA TTG CTA ATT 
s2 S A^n Gly His Asn He Gin Thr He Asp Gin Lys Leu Leu He 

420 425 



GAA AAT ATC ACC GTC GTA GAA 
Glu Asn lie Thr Val Val Glu 
435 



STE6 



CAG TTT TCC TGG ATT ATG CCT GGC ACC 
Gin Phe Ser Trp He Met Pro Gly Thr 
440 445 

1 cm 



ATT AAA GAA AAT ATC ATC TTT GGT GTT TCC TAT GAT GAA TAT AGA TAC 
S £S Si £!n lie lie Phe Gly Val Ser Tyr Asp Glu Tyr Arg Tyr 
450 455 460 

ir , Arr GTC ATC aaa GCA TGC CAA CTA GAA GAG GAC ATC TCC AAG TTT 
£g sir SS lie Si S3 cys Gin Leu Glu Glu Asp He Ser Lys Phe 
465 470 475 

GCA GAG AAA GAC AAT ATA GTT CTT GGA GAA GGT GGA ATC ACA CTG AGT 
111 Su iji Sp Asn lie Val Leu Gly Glu Gly Gly He Thr L*u Ser 



GGA GGT CAA CGA GCA AGA ATT 
Gly Gly Gin Arg Ala Arg He 
500 



TCT TTA 
Ser Leu 
505 



GCA 
Ala 



AGA | GCA 
Arg Ala 

CFTR — I— STE6 



TTC 
Phe 



ATC AGA 
He Arg 
510 



GAT 
Asp 



ACT CCA ATA TTA TTC TTA GAC GAA GCT GTA TCG GCT CTA GAT ATT GTT 
JS S?i l2 Su £S £eu Asp Glu Ala Val Ser Ala Leu Asp He Val 
515 520 525 

CAT CGC AAC CTG TTG ATG AAG GCA ATT AGG CAT TGG AGG AAA GGA AAG 
Sis Arg Asn Leu Leu Met Lys Ala He Arg His Trp Arg Lys Gly Lys 
530 535 540 

ACT ACA ATC ATA TTG ACG CAT GAG TTG AGC CAA ATT GAA TCT GAT GAC 
Thr Thr lie lie Leu Thr His Glu Leu Ser Gin He Glu Ser Asp Asp 

545 550 555 560 

TAT TTA TAT TTA ATG AAG GAA GGT GAA GTT GTT GAA AGC GGC ACC CAG 
5? Leu Tyr Leu Met Lys Glu Gly Glu Val Val Glu Ser Gly Thr Gin 
565 570 575 

TCT GAA CTT CTA GCC GAT CCG ACC ACT ACA TTT AGC ACA TGG TAT CAC 
sS Si £S SS Ala Asp Pro Thr Thr Thr Phe Ser Thr Trp Tyr His 
5S0 585 590 

~™ r**r rir TAC TCT GAT GCG AAA ACT ATT GTA GAT ACA GAG ACT 

Su SS £n Asp T^r sS Asp Ala Lys Thr He Val Asp Thr Glu Thr 

595 600 0 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 
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t% Si Si IS S SS £ 51? Si S S *| IS Si S SI 

610 ' 615 620 

ACA CCA AAA CTT GGA TCA TGC TTA ACT AAT CTG GGA TAT GAT GAG ACA 
Thr Pro Lys Leu Gly Ser Cys Leu Ser Asn Leu Gly Tyr Asp Giu inr 
625 630 63i 

GAT CAG TTG TCC TTT TAC GAA GCA ATC TAT CAA AAA AGA TCG AAC GOT 
Asp SS Su Ser Phe Tyr Glu Ala lie Tyr Gin Lys Arg Ser Asn Val 



645 



AGA ACA AGA AGG GTT AAA GTT GAA GAG GAA AAT ATT GGG TAT GCA CTA 
Arg iS £g Jrg Val Lys Val Glu Glu Glu Asn lie Gly Tyr Ala Leu 
660 665 

AAA CAA CAA AAG AAC ACC GAA AGT TCA ACA GGG CCA CAA CTT CTG AGC 
Lys Gin Gin Lys Asn Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu ser 
675 680 

ATT ATT CAG ATT ATC AAA AGA ATG ATT AAA AGC ATA AGA TAC AAA AAA 
iTe S Gin lie lie Lys Arg Met lie Lys Ser lie Arg Tyr Lys Lys 

690 • 695 700 

ATT CTA ATC TTG GGA CTG CTA TGT TCT CTT ATC GCA GGC GCC ACA AAT 
lie Leu lie Leu Gly Leu Leu Cys Ser Leu lie Ala Gly Ala Tnr Asn 
705 710 

CCC GTC TTT TCA TAC ACA TTC AGT TTC TTA CTA GAA GGA ATT GTC CCA 
Pro val Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly lie Val Pro 

725 730 

TCC ACG GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG 
ser Thr Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser 
740 745 

CTT CTT GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GCT 
Leu Leu Val Leu Gly Val Ala Ala Ala Asp Gly lie Phe Asn Phe Ala 

755 760 765 

si s s si si s ss I?* s as s s - s s s 

770 775 

AGA AAT GAA GTT ATG GAA AAA CTG ACG AGA AAG AAT ATG GAC TGG TTT 
£g £S gS val Set Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe 
785 790 

_ rrT TCT GAA A TT TCT GCT CTA GTC TTG AAT 

TCT GGT GAA AAC AAC AAG GCT TCT GAA Ail ±yx v> 

Ser Gly Glu Asn Asn Lys Ala Ser Glu lie Ser Ala Leu Val Leu Asn 

805 8 1 0 



1872 
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2016 
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GAT TTG CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG 2496 
Asp Leu Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met 
820 825 830 

ACT AGT TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA GTA 2544 
Thr Ser Phe Val Thr Val Ser Thr He Gly Leu He Trp Ala Leu Val 
835 840 845 

TCG GGC TGG AAG TTA AGT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA 2592 
Ser Gly Trp Lys Leu Ser Leu Val Cys He Ser Met Phe Pro Leu He 
850 855 860 

ATT ATA TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA 2640 
lie He Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr 
865 870 ^ 875 880 

GAT TAT AAG ACA TCT GTT GCT CAG TTA GAA AAC TGC CTG TAC CAG ATT 2688 
Asp Tyr Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He 
885- 890 895 

GTC ACT AAC ATT AAA ACC ATT AAG TGC TTA CAA GCT GAA TTT CAT TTT 2736 
Val Thr Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe 
900 905 910 

CAA TTG ACC TAC CAT GAC TTG AAG ATA AAA ATG CAA CAA ATT GCC TCC 2784 
Gin Leu Thr Tyr His Asp Leu Lys He Lys Met Gin Gin He Ala Ser 
915 920 925 

AAA CGC GCC ATT GCC ACA GGA TTT GGT ATA TCT ATG ACA AAC ATG ATT 2832 
Lys Arg Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met He 
930 935 940 

GTC ATG TGT ATC CAA GCT ATT ATT TAC TAC TAT GGC CTA AAG CTG GTT 2880 
Val Met cys He Gin Ala He He Tyr Tyr Tyr Gly Leu Lys Leu Val 
945 950 955 960 

ATG ATT CAC GAG TAC ACC TCA AAG GAA ATG TTT. ACG ACT TTC ACT TTG 2928 
Met He His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu 
965 970 975 

TTA TTA TTC ACT ATT ATG TCA TGC ACT AGC CTA GTA AGT CAG ATA CCC 2976 
Leu Leu Phe Thr He Met Ser Cys Thr Ser Leu Val Ser Gin He Pro 
980 985 990 

GAT ATA AGT AGA GGC CAA CGT GCT GCC AGT TGG ATC TAT AGG ATT CTT 3024 
Asp He Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg He Leu 
995 * 1000 1005 

GAT GAA AAG CAT AAT ACC CTA GAG GTT GAA AAC AAT AAT GCT AGA ACA 3072 
Asp Glu Lys His Asn Thr Leu Glu Val Glu Asn Asn Asn Ala Arg Thr 
1010 1015 1020 
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GTG GGA ATA GCT GGT CAC ACC TAC CAT GGC AAA GAA AAA AAA CCA ATC 3120 
Val Gly He Ala Gly His Thr Tyr His Gly Lys Glu Lys Lys Pro He 
1025 1030 1035 1040 

GTT TCA ATT CAA AAT TTG ACA TTT GCC TAT CCA TCT GCA CCT ACC GCC 3168 
Val Ser He Gin Asn Leu Thr Phe Ala Tyr Pro Ser Ala Pro Thr Ala 
1045 1050 1055 

TTT GTT TAC AAA AAC ATG AAT TTT GAC ATG TTT TGC GGA CAG ACG TTA 3216 
Phe Val Tyr Lys Asn Met Asn Phe Asp Met Phe Cys Gly Gin Thr Leu 
1060 1065 1070 

GGT ATC ATT GGT GAA TCA GGC ACA GGA AAG TCT ACA CTT GTG CTT TTA 3264 
Gly He He Gly Glu Ser Gly Thr Gly Lys Ser Thr Leu Val Leu Leu 
1075 1080 1085 

TTA ACA AAA CTT TAT AAT TGT GAA GTA GGC AAA ATT AAA ATA GAC GGT 3312 
Leu Thr Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys He Asp Gly 
1090 1095 HOO 

ACG GAC GTA AAT GAC TGG AAT TTG ACA AGT TTA AGA AAA GAA ATT TCA 3360 
Thr Asp Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser 
1105 1110 1115 1120 

GTG GTT GAG CAA AAA CCT TTA TTA TTC AAT GGA ACC ATC AGA GAT AAC 3408 
Val Val Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn 
1125 1130 1135 

CTA ACT TAT GGT TTA CAA GAT GAA ATA CTT GAA ATT GAA ATG TAT GAT 34 56 

Leu Thr Tyr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp 
1140 1145 1150 

GCA TTA AAA TAC GTA GGA ATC CAT GAC TTT GTA ATT TCA TCA CCT CAG 3504 
Ala Leu Lys Tyr Val Gly lie His Asp Phe Val He Ser Ser Pro Gin 
1155 ~ 1160 1165 

GGC TTG GAT ACA CGT ATT GAT ACA ACT TTA CTG TCA GGT GGA CAA GCG 3552 
Gly Leu Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala 
1170 1175 1180 

CAA AGG CTT TGC ATA GCC AGA GCA CTT CTG AGA AAA TCA AAA ATT CTG 3600 
Gin Arg Leu Cys He Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu 
1185 " 1190 1195 1200 

ATT TTA GAT GAG TGT ACT TCA GCC TTG GAT TCT GTC AGC TCC TCT ATC 3648 
He Leu Asp Glu Cys Thr Ser Ala Leu Asp Ser Val Ser Ser Ser He 
1205 1210 1215 

ATC AAT GAG ATC GTC AAA AAA GGT CCA CCT GCT CTA CTA ACA ATG GTT 3696 
He Asn Glu He Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val 
1220 1225 1230 
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ATA ACG CAT AGT GAA CAA ATG ATG AGG TCT TGT AAC TCG ATT GCA GTT 3744 
lie Thr His Ser Glu Gin Met Met Arg Ser Cys Asn Ser He Ala Val 
1235 1240 1245 

CTT AAA GAT GGT AAA GTG GTT GAG CGA GGT AAC TTC GAC ACT TTA TAT 3792 
Leu Lys Asp Gly Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr 
1250 " ' 1255 1260 

AAT AAT CGC GGG GAA TTA TTC CAA ATT GTT TCC AAC CAA AGC AGT TAA 3840 
Asn Asn Arg Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser * 
1265 ' * 1270 1275 1280 
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H6 Chimera (Seq. ID No. 5) 

ATG AAC TTT TTA AGT TTT AAG ACT ACA AAA CAC TAT CAC ATT TTC AGG 4 8 

Met Asn Phe Leu Ser Phe Lys Thr Thr Lys His Tyr His He Phe Arg 
1 5 10 15 

TAC GTG AAC ATA CGG AAT GAC TAC AGG CTG TTA ATG ATA ATG. ATA ATA 96 
Tyr Val Asn He Arg Asn Asp Tyr Arg Leu Leu Met He Met He He 
20 25 30 

GGT ACC GTG GCA CTA TCA GTT TTC GTG GCT AAT GGG TCA CAT CAA GGT 144 
Glv Thr Val Ala Leu Ser Val Phe Val Ala Asn Gly Ser His Gin Gly 
35 40 45 

TTG TAT TCC CAA CTA GTA CAG AGG TCA ATG GCA GTA ATG GCA CTT GGT 192 
Leu Tyr Ser Gin Leu Val Gin Arg Ser Met Ala Val Met Ala Leu Gly 
50 55 60 

GCG GCT TCT GTG CCA GTA ATG TGG CTT TCT CTA ACA AGT TGG ATG CAC 240 
Ala Ala Ser Val Pro Val Met Trp Leu Ser Leu Thr Ser Trp Met His 
65 70 75 80 

ATC GGC GAG AGA CAA GGC TTT AGA ATA CGG TCA CAG ATA TTG GAG GCA 288 
He Gly Glu Arg Gin Gly Phe Arg He Arg Ser Gin He Leu Glu Ala 
85 90 95 

TAT TTG GAG GAA AAG CCA ATG GAA TGG TAC GAC AAT AAT GAA AAA TTG 336 
Tyr Leu Glu Glu Lys Pro Met Glu Trp Tyr Asp Asn Asn Glu Lys Leu 
100 105 HO 

TTA GGA GAT TTT ACT CAA ATC AAC AGA TGT GTG GAA GAG CTA AGA TCA 384 
Leu Gly Asp Phe Thr Gin He Asn Arg Cys Val Glu Glu Leu Arg Ser 
115 120 125 

AGC TCC GCA GAG GCA TCA GCC ATA ACT TTC CAG AAT TTA GTT GCA ATA 432 
Ser Ser Ala Glu Ala Ser Ala He Thr Phe Gin Asn Leu Val Ala He 
130 135 140 

TGT GCG CTT CTG GGG ACG TCA TTC TAC TAT TCT TGG TCA TTA ACT TTA 480 
Cys Ala Leu Leu Gly Thr. Ser Phe Tyr Tyr Ser Trp Ser Leu Thr Leu 
145 150 155 160 

ATT ATT CTT TGC AGC TCT CCA ATA ATC ACA TTT TTT GCA GTG GTG TTT 528 
He He Leu Cys Ser Ser Pro He He Thr Phe Phe Ala Val Val Phe 
165 170 175 

TCC AGA ATG ATT CAT GTA TAT TCA GAG AAG GAG AAT TCT GAA ACG AGT 576 
Ser Ara Met He His Val Tyr Ser Glu Lys Glu Asn Ser Glu Thr Ser 
180 185 1^0 
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AAA GCA GCC CAA TTA CTT ACA TGG TCG ATG AAT GCC GCT CAA TTA GTG 624 

Lys Ala Ala Gin Leu Leu Thr Trp Ser Met Asn Ala Ala Gin Leu Val 
195 200 205 

AGA TTA TAT TGT ACA CAA CGT CTA GAA AGG AAA AAA TTC AAG GAA ATC 672 

Arg Leu Tyr Cys Thr Gin Afg Leu Glu Arg Lys Lys Phe Lys Glu lie 
210 215 220 

ATA CTA AAT TGT AAC ACT TTC TTC ATC AAG AGT TGC TTT TTT GTT GCT 720 

He Leu Asn Cys Asn Thr Phe Phe He Lys Ser Cys Phe Phe Val Ala 

225 - 230 235 240 

GCA AAC GCT GGG ATC TTG AGA TTT TTG ACG TTG ACT ATG TTT GTA CAG 768 

Ala Asn Ala Gly lie Leu Arg Phe Leu Thr Leu Thr Met Phe Val Gin 
245 250 255 

GGA TTC TGG TTT GGT TCC GCA ATG ATC AAA AAG GGC AAG CTG AAC ATT 816 

Gly Phe Trp Phe Gly Ser Ala Met He Lys Lys Gly Lys Leu Asn He 
260 265 270 

AAC GAT GTA ATC ACT TGC TTC CAT TCA TGC ATT ATG CTG GGC TCG ACT 864 

Asn Asp Val He Thr Cys Phe His Ser Cys He Met Leu Gly Ser Thr 
275 280 285 

TTA AAT AAT ACA TTA CAC CAA ATA GTT GTT CTT CAA AAA GGC GGA GTG 912 

Leu Asn Asn Thr Leu His Gin He Val Val Leu Gin Lys Gly Gly Val 
290 295 300 

GCT ATG GAA AAA ATC ATG ACT CTA TTA AAA GAT GGA TCC AAG CGA AAT 960 

Ala Met Glu Lys lie Met Thr Leu Leu Lys Asp Gly Ser Lys Arg Asn 

305 " 310 315 320 

CCT TTA AAT AAA ACT GTA GCC CAC CAA TTT CCA CTA GAT TAT GCC ACC 1008 

Pro Leu Asn Lys Thr Val Ala His Gin Phe Pro Leu Asp Tyr Ala Thr 

325 330 335 

AGT GAT CTA ACA TTT GCT AAT GTT TCG TTT TCT TAT CCA AGC AGA CCT 1056 

Ser Asp Leu Thr Phe Ala Asn Val Ser Phe Ser Tyr Pro Ser Arg Pro 
340 345 350 

TCG GAA GCA GTT TTA AAG AAC GTT AGT TTA AAT TTC TCT GCA GGA CAA 1104 

Ser Glu Ala Val Leu Lys Asn Val Ser Leu Asn Phe Ser Ala Gly Gin 
355 360 365 

TTT ACT TTC ATA GTA GGA AAA TCA GGC TCA GGT AAA TCT ACA TTA TCC 1152 

Phe Thr Phe He Val Gly Lys Ser Gly Ser Gly Lys Ser Thr Leu Ser 

370 375 380 

AAC TTA TTA TTA AGG TTC TAC GAT GGC TAT AAT GGA TCG ATA TCT ATC 1200 

Asn Leu Leu Leu Arg Phe Tyr Asp Gly Tyr Asn Gly Ser He Ser He 

385 390 395 400 
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AAT GGC CAC AAT ATC CAA ACA ATC GAC CAA AAA TTG CTA ATT GAA AAT 1248 
Asn Gly His Asn lie Gin Thr lie Asp Gin Lys Leu Leu He Glu Asn 
405 410 415 



ATC ACC GTC GTA GAA 
He Thr Val Val Glu 
420 
STE6 



CAG TTT TCC TGG ATT ATG OCT GGC ACC ATT AAA 1296 

Gin Phe Ser Trp He Met Pro Gly Thr He Lys 
425 430 

CFTR 



GAA AAT ATC ATC TTT GGT GTT TCC TAT GAT GAA TAT AGA TAC "AGA AGC 1344 
Glu Asn He He Phe Gly Val Ser Tyr Asp Glu Tyr Arg Tyr Arg Ser 
435 440 445 

GTC ATC AAA GCA TGC CAA CTA GAA GAG GAC ATC TCC AAG TTT GCA GAG 1392 
Val He Lys Ala Cys Gin Leu Glu Glu Asp He Ser Lys Phe Ala Glu 
450 455 460 

AAA GAC AAT ATA GTT CTT GGA GAA GGT GGA ATC ACA CTG AGT GGA GGT 1440 
Lys Asp Asn He Val Leu Gly Glu Gly Gly He Thr Leu Ser Gly Gly 
465 470 475 480 

CAA CGA GCA AGA ATT TCT TTA GCA AGA GCA GTA TAC AAA GAT GCT GAT 1488 
Gin Arg Ala Arg He Ser Leu Ala Arg Ala Val Tyr Lys Asp Ala Asp 
485 490 495 

TTG TAT TTA TTA GAC TCT CCT TTT GGA TAC CTA GAT| ATT GTT CAT CGC 1536 
Leu Tyr Leu Leu Asp Ser Pro Phe Gly Tyr Leu Aspl He Val His Arg 

cm J- srl£° 

AAC CTG TTG ATG AAG GCA ATT AGG CAT TGG AGG AAA GGA AAG ACT ACA 1584 
Asn Leu Leu Met Lys Ala He Arg His Trp Arg Lys Gly Lys Thr Thr 
515 520 525 

ATC ATA TTG ACG CAT GAG TTG AGC CAA ATT GAA TCT GAT GAC TAT TTA 1632 
He He Leu Thr His Glu Leu Ser Gin He Glu Ser Asp Asp Tyr Leu 
530 535 540 

TAT TTA ATG AAG GAA GGT GAA GTT GTT GAA AGC GGC ACC CAG TCT GAA 1680 
Tyr Leu Met Lys Glu Gly Glu Val val Glu Ser Gly Thr Gin Ser Glu 
545 550 555 560 

CTT CTA GCC GAT CCG ACC ACT ACA TTT AGC ACA TGG TAT CAC CTA CAG 1728 
Leu Leu Ala Asp Pro Thr Thr Thr Phe Ser Thr Trp Tyr His Leu Gin 
565 570 575 

AAT GAC TAC TCT GAT GCG AAA ACT ATT GTA GAT ACA GAG ACT GAA GAA 1776 
Asn Asp Tyr Ser Asp Ala Lys Thr He Val Asp Thr Glu Thr Glu Glu 
580 * 585 590 

AAA TCT ATA CAC ACT GTG GAA AGT TTT AAC TCT CAA TTG GAA ACA CCA 1824 
Lys Ser He His Thr Val Glu Ser Phe Asn Ser Gin Leu Glu Thr Pro 
595 600 605 
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AAA CTT GGA TCA TGC TTA AGT AAT CTG GGA TAT GAT GAG ACA GAT CAG 1872 
Lys Leu Gly Ser Cys Leu Ser Asn Leu Gly Tyr Asp Glu Thr Asp Gin 
610 615 620 

TTG TCC TTT TAC GAA GCA ATC TAT CAA AAA AGA TCG AAC GTT AGA ACA 1920 
Leu Ser Phe Tyr Glu Ala lie Tyr Gin Lys Arg Ser Asn Val Arg Thr 
625 630 635 640 

AGA AGG GTT AAA GTT GAA GAG GAA AAT ATT GGG TAT GCA CTA" AAA CAA 1968 
Arg Arg Val Lys Val Glu Glu Glu Asn lie Gly Tyr Ala Leu Lys Gin 
645 650 655 

CAA AAG AAC ACC GAA AGT TCA ACA GGG CCA CAA CTT CTG AGC ATT ATT 2016 
Gin Lys Asn Thr Glu Ser Ser Thr Gly Pro Gin Leu Leu Ser lie lie 
660 665 670 

CAG ATT ATC AAA AGA ATG ATT AAA AGC ATA AGA TAC AAA AAA ATT CTA 2064 
Gin lie lie Lys Arg Met lie Lys Ser He Arg Tyr Lys Lys He Leu 
675 680 685 

ATC TTG GGA CTG CTA TGT TCT CTT ATC GCA GGC GCC ACA AAT CCC GTC 2112 
He Leu Gly Leu Leu Cys Ser Leu He Ala Gly Ala Thr Asn Pro Val 
690 " 695 700 

TTT TCA TAC ACA TTC AGT TTC TTA CTA GAA GGA ATT GTC CCA TCC ACG 2160 
Phe Ser Tyr Thr Phe Ser Phe Leu Leu Glu Gly He Val Pro Ser Thr 
705 710 715 720 

GAT GGA AAA ACT GGC TCT TCA CAT TAT TTG GCG AAA TGG TCG CTT CTT 2208 
Asp Gly Lys Thr Gly Ser Ser His Tyr Leu Ala Lys Trp Ser Leu Leu 
725 730 735 

GTT CTT GGT GTG GCT GCG GCA GAT GGT ATT TTC AAT TTT GCT AAA GGA 2256 
Val Leu Gly Val Ala Ala Ala Asp Gly He Phe Asn Phe Ala Lys Gly 
740 745 750 

TTC CTA TTA GAT TGC TGC AGT GAA TAC TGG GTT ATG GAT CTT AGA AAT 2304 
Phe Leu Leu Asp Cys Cys Ser Glu Tyr Trp Val Met Asp Leu Arg Asn 
755 ^ 760 765 

GAA GTT ATG GAA AAA CTG ACG AGA AAG AAT ATG GAC TGG TTT TCT GGT 2352 
Glu Val Met Glu Lys Leu Thr Arg Lys Asn Met Asp Trp Phe Ser Gly 
770 775 780 

GAA AAC AAC AAG GCT TCT GAA ATT TCT GCT CTA GTC TTG AAT GAT TTG 2400 
Glu Asn Asn Lys Ala Ser Glu He Ser Ala Leu Val Leu Asn Asp Leu 
785 790 795 800 

CGA GAT TTG AGG TCT TTG GTC TCT GAA TTT TTG AGT GCA ATG ACT AGT 2448 
Arg Asp Leu Arg Ser Leu Val Ser Glu Phe Leu Ser Ala Met Thr Ser 
805 810 815 
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TTC GTT ACC GTA TCA ACG ATT GGA CTA ATT TGG GCG TTA GTA TCG GGC 24 9 6 

Phe Val Thr Val Ser Thr He Gly Leu He Trp Ala Leu Val Ser Gly 
B20 825 830 

TGG AAG TTA AGT TTG GTT TGT ATT TCG ATG TTT CCA CTC ATA ATT ATA 2544 

Trp Lys Leu Ser Leu Val Cys lie Ser Met Phe Pro Leu He He He 
835 840 845 

TTT TCA GCA ATA TAT GGA GGC ATT TTA CAA AAA TGC GAA ACA GAT TAT 2592 

Phe Ser Ala He Tyr Gly Gly He Leu Gin Lys Cys Glu Thr Asp Tyr 
850 * 855 860 

AAG ACA TCT GTT GOT CAG TTA GAA AAC TGC CTG TAG CAG ATT GTC ACT 2640 

Lys Thr Ser Val Ala Gin Leu Glu Asn Cys Leu Tyr Gin He Val Thr 
865 870 875 880 

AAC ATT AAA ACC ATT AAG TGC TTA CAA GCT GAA TTT CAT TTT CAA TTG 2688 

Asn He Lys Thr He Lys Cys Leu Gin Ala Glu Phe His Phe Gin Leu 

885 890 895 

ACC TAC CAT GAC TTG AAG ATA AAA ATG CAA CAA ATT GCC TCC AAA CGC 27 36 

Thr Tyr His Asp Leu Lys He Lys Met Gin Gin He Ala Ser Lys Arg 
900 905 910 

GCC ATT GCC ACA GGA TTT GGT ATA TCT ATG ACA AAC ATG ATT GTC ATG 2784 

Ala He Ala Thr Gly Phe Gly He Ser Met Thr Asn Met He Val Met 
915 920 925 

TGT ATC CAA GCT ATT ATT TAC TAC TAT GGC CTA AAG CTG GTT ATG ATT 2832 

Cys He Gin Ala He He Tyr Tyr Tyr Gly Leu Lys Leu Val Met He 
930 935 940 

CAC GAG TAC ACC TCA AAG GAA ATG TTT ACG ACT TTC ACT TTG TTA TTA 2880 

His Glu Tyr Thr Ser Lys Glu Met Phe Thr Thr Phe Thr Leu Leu Leu 
945 950 955 960 

TTC ACT ATT ATG TCA TGC ACT AGC CTA GTA AGT CAG ATA CCC GAT ATA 2928 

Phe Thr He Met Ser Cys Thr Ser Leu Val Ser Gin He Pro Asp He 

965 970 975 

AGT AGA GGC CAA CGT GCT GCC AGT TGG ATC TAT AGG ATT CTT GAT GAA 2976 

Ser Arg Gly Gin Arg Ala Ala Ser Trp He Tyr Arg He Leu Asp Glu 
980 985 990 

AAG CAT AAT ACC CTA GAG GTT GAA AAC AAT AAT GCT AGA ACA GTG GGA 3024 

Lys His Asn Thr Leu Glu Val Glu Asn Asn Asn Ala Arg Thr Val Gly 
995 1000 1005 

ATA GCT GGT CAC ACC TAC CAT GGC AAA GAA AAA AAA CCA ATC GTT TCA 3072 

He Ala Gly His Thr Tyr His Gly Lys Glu Lys Lys Pro He Val Ser 
1010 1015 1020 
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ATT CAA AAT TTG ACA TTT GCC TAT CCA TCT GCA CCT ACC GCC TTT GTT 312 0 

lie Gin Asn Leu Thr Phe Ala Tyr Pro Ser Ala Pro Thr Ala Phe Val 
1025 1030 1035 1040 

TAC AAA AAC ATG AAT TTT GAC ATG TTT TGC GGA CAG ACG TTA GGT ATC 3168 
Tyr Lys Asn Met Asn Phe Asp Met Phe Cys Gly Gin Thr Leu Gly He 
1045 1050 1055 

ATT GGT GAA TCA GGC ACA GGA AAG TCT ACA CTT GTG CTT TTA TTA ACA 3216 
He Gly Glu Ser Gly Thr Gly Lys Ser Thr Leu Val Leu Leu Leu Thr 
1060 1065 1070 

AAA CTT TAT AAT TGT GAA GTA GGC AAA ATT AAA ATA GAC GGT ACG GAC 3264 
Lys Leu Tyr Asn Cys Glu Val Gly Lys He Lys He Asp Gly Thr Asp 
1075 1080 1085 

GTA AAT GAC TGG AAT TTG ACA AGT TTA AGA AAA GAA ATT TCA GTG GTT 3312 
Val Asn Asp Trp Asn Leu Thr Ser Leu Arg Lys Glu He Ser Val Val 
1090 1095 1100 

GAG CAA AAA CCT TTA TTA TTC AAT GGA ACC ATC AGA GAT AAC CTA ACT 3360 
Glu Gin Lys Pro Leu Leu Phe Asn Gly Thr He Arg Asp Asn Leu Thr 
1105 mo 1115 1120 

TAT GGT TTA CAA GAT GAA ATA CTT GAA ATT GAA ATG TAT GAT GCA TTA 3408 
Tyr Gly Leu Gin Asp Glu He Leu Glu He Glu Met Tyr Asp Ala Leu 
1125 1130 1135 

AAA TAC GTA GGA ATC CAT GAC TTT GTA ATT TCA TCA CCT CAG GGC TTG 3456 
Lys Tyr Val Gly He His Asp Phe Val He Ser Ser Pro Gin Gly Leu 
1140 1145 1150 

GAT ACA CGT ATT GAT ACA ACT TTA CTG TCA GGT GGA CAA GCG CAA AGG 3 504 

Asp Thr Arg He Asp Thr Thr Leu Leu Ser Gly Gly Gin Ala Gin Arg 
1155 1160 1165 

CTT TGC ATA GCC AGA GCA CTT CTG AGA AAA TCA AAA ATT CTG ATT TTA 3552 
Leu Cys He Ala Arg Ala Leu Leu Arg Lys Ser Lys He Leu He Leu 
1170 1175 1180 

GAT GAG TGT ACT TCA GCC TTG GAT TCT GTC AGC TCC TCT ATC ATC AAT 3600 
Asp Glu Cys Thr Ser Ala Leu Asp Ser Val Ser Ser Ser He He Asn 
1185 1190 1195 1200 

GAG ATC GTC AAA AAA GGT CCA CCT GCT CTA CTA ACA ATG GTT ATA ACG 3 64 8 

Glu He Val Lys Lys Gly Pro Pro Ala Leu Leu Thr Met Val He Thr 
1205 1210 1215 

CAT AGT GAA CAA ATG ATG AGG TCT TGT AAC TCG ATT GCA GTT CTT AAA 3 696 

His Ser Glu Gin Met Met Arg Ser Cys Asn Ser He Ala Val Leu Lys 
1220 1225 1230 
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GAT GGT AAA GTG GTT GAG CGA GGT AAC TTC GAC ACT TTA TAT AAT AAT 3744 
Asp Gly Lys Val Val Glu Arg Gly Asn Phe Asp Thr Leu Tyr Asn Asn 
1235 1240 1245 

CGC GGG GAA TTA TTC CAA ATT GTT TCC AAC CAA AGC AGT TAA 3786 
Arg Gly Glu Leu Phe Gin He Val Ser Asn Gin Ser Ser * 
1250 1255 1260 
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